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Preface 


This second volume of a comprehensive edition of the works of Kurt 
Godel contains the remainder of his published work, covering the period 
1938-1974. (Volume I included all of his publications from 1929 to 1936; 
no work of his appeared during 1937.) Succeeding volumes are to contain 
selections from Godel’s unpublished manuscripts, lectures, lecture notes 
and correspondence, as well as extracts from his scientific notebooks. 

For a detailed explanation of the plan for this edition, the reader should 
consult the Preface to Volume I of th^se Works. To summarize briefly, 
each article or closely related group of articles is preceded by an introduc¬ 
tory note that elucidates it and places it in historical context. These notes 
(varying greatly in length) have been written by the members of the edi¬ 
torial board as well as a number of outside experts. Furthermore, the only 
article originally written in German, 1958, is accompanied by an English 
translation on facing pages. As in Volume I, the original articles have been 
typeset anew in a uniform and more readable format. Finally, the exten¬ 
sive list of references in this volume contains all items referred to either by 
Godel or in the introductory notes, and also includes all the items listed in 
Volume I. 

Here again, our aim has been to make the full body of Godel’s work 
as accessible and useful to as wide an audience as possible, without in 
any way sacrificing the requirements of historical and scientific accuracy. 
We expect these volumes to be of interest and value to professionals and 
students in the areas of logic, mathematics, computer science and even 
physics, as well as to many non-specialist readers with a broad scientific 
background. Naturally, even with the assistance of the introductory notes, 
not all of Godel’s work can be made equally accessible to such a variety 
of readers; but the general reader should nonetheless be able to gain some 
appreciation of what Godel accomplished in each case. 

We continue to be indebted to the National Science Foundation and 
the Sloan Foundation, whose grants have made possible the production of 
Volumes I and II as well as preparations for succeeding volumes, and to 
the Association for Symbolic Logic, which has sponsored our project and 
administered these grants. Our publisher, Oxford University Press, has 
once more been very accommodating to both our overall plans and our 
specific wishes. Much of our work was done with the assistance of former 
Oxford Science Editor Donald Degenhardt; following his return to England, 
we have received the assistance first of Jeffrey W. House, Vice-President 
and Executive Editor for Science and Medicine, and more recently of the 
new Science Editor, Dr. Jacqueline E. Hartt. 
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For the names of the many other individuals who have helped make these 
first two volumes of Godel’s Collected works possible, the reader should 
refer to the Preface to Volume I. Our gratitude to all for their assistance 
is undiminished. 

Solomon Feferman 


Jean van Heijenoort, our dear friend and co-editor, died on 29 
March 1986. His contributions to our work on Kurt Godel were 
invaluable at every stage and in every respect. At the outset, 
his enthusiastic support was instrumental in our decision to em¬ 
bark upon this project. Then, drawing upon his own extensive 
editorial experience, he helped us to develop our overall plans 
as well as to make the many detailed choices, and throughout 
the course of the work he devoted himself unstintingly and with 
the utmost care to whatever task was at hand. 

The present volume was largely completed by the time of 
van Heijenoort’s death; indeed, he had already begun a detailed 
examination of some of Godel’s unpublished articles for the suc¬ 
ceeding volume. His spirit will continue to animate all our work, 
and we have taken his standards as our own. 
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Introductory notes. The purpose of the notes described in the Pref¬ 
ace above is (i) to provide a historical context for the items introduced, 
(ii) to explain their contents to a greater or lesser extent, (iii) to discuss 
further developments which resulted from them and (iv) in some cases to 
give a critical analysis. Each note was read in draft form by the editorial 
board, and then modified by the respective authors in response to criti¬ 
cisms and suggestions, the procedure being repeated as often as necessary 
in the case of very substantial notes. No attempt, however, has been made 
to impose uniformity of style, point of view, or even length. While the 
editorial board actively engaged in a critical and advisory capacity in the 
preparation of each note and made the final decision as to its acceptabil¬ 
ity, primary credit and responsibility for the notes rest with the individual 
authors. 

Introductory notes are distinguished typographically by a running ver¬ 
tical line along the left- or right-hand margin and are boxed off at their 
end. a The authorship of each note is given in the Contents and at the end 
of the note itself. 

References. Each volume contains a comprehensive References section 
which comprises the following three categories of items: (i) a complete 
bibliography of Godel’s own published work, (ii) all items referred to by 
Godel in his publications and (iii) all items referred to in the chapter in 
Volume I on Godel’s life and work or in the individual introductory notes. 

In the list of references, each item is assigned a date with or without 
a letter suffix, e.g., “1930”, “1930a”, “1930b”, etc. b The date is that of 
publication, where there is a published copy, or of presentation, for unpub¬ 
lished items such as a speech. A suffix is used when there is more than 
one publication in that year. (The ordering of suffixes does not necessarily 
correspond to order of publication within any given year.) Date of compo¬ 
sition has not been used for references, since that is frequently unavailable 
or only loosely determined. 

Within the text of our volumes, all references are supplied by citing 
author(s) and date in italics, e.g., Godel 1930 or Hilbert and Bernays 1934- 
Where no name is specified or determined by the context, the reference is 
to Godel’s bibliography, as e.g., in “Introductory note to 1929, 1930 and 


a A special situation occurs when the note ends in mid-page before facing German 
and English text. Then the note extends across the top half of the facing pages and is 
boxed off accordingly. 

b “198?” is used for articles whose date of publication is to be in 1988 or later, or is 
not yet known. 
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1930a". Examples of the use of a name to set the context for a reference 
are: “Frege’s formal system presented in 1879”, “Skolem proved in 1920 
that ..and “Skolem (1920) proved that .. 

There are two works by Godel, 1929 and 1972, whose dating required 
special consideration; they appear in Volumes I and II, respectively. The 
first of these is Godel’s dissertation at the University of Vienna; its date is 
that of the year in which the dissertation was submitted (as distinguished 
from the date of its acceptance, which was 1930). The second, a transla¬ 
tion and revision of Godel’s paper 1958, was intended for publication in 
the journal Dialectica but hitherto never actually appeared. It reached the 
stage of page proofs and was found in that form in Godel’s Nachlass. Corre¬ 
spondence surrounding this projected publication shows that Godel worked 
on the revision sporadically over a number of years, beginning in 1965; the 
last date for which we have evidence of his making specific changes is 1972, 
and that date has therefore been assigned to it in our References. (For 
more information concerning this work, see the introductory note to 1958 
and 1972 in this volume.) Appended to the page proofs of 1972 were three 
short notes on the incompleteness results; they have been assigned the date 
1972a in the References. 

To make the References as useful as possible for historical purposes, au¬ 
thors’ names are there supplied with first and/or middle names as well as 
initials, except when the information could not be determined. Russian 
names are given both in transliterated form and in their original Cyril¬ 
lic spelling. In some cases, common variant transliterations of the same 
author’s name, attached to different publications, are also noted. 

Editorial annotations and textual notes. Editorial annotations within 
any of the original texts or their translations or within items quoted from 
other authors are signaled by double square brackets: |[ ]. Single square 
brackets [ ] are used to incorporate corrections supplied by Godel. In 
some articles, editorial footnotes are inserted in double square brackets 
for a further level of annotation. Each volume has, in addition, a sepa¬ 
rate list of textual notes in which other corrections are supplied. Finally, 
the following kinds of changes are made uniformly in the original texts: 
(i) footnote numbers are raised above the line as simple numerals, e.g., 2 
instead of ; (ii) spacing used for emphasis in the original German is here 
replaced by italics, e.g., erfiillbaris replaced by erfullbar; (iii) refer¬ 
ences are replaced by author(s) and date, as explained above; (iv) initial 
sub-quotes in German are raised, e.g., „engeren” becomes “engeren”. 

Translations. The overall aim for the translations, as well as the variety 
of work required and general responsibility for them has been described in 
the Preface to Volume I. The only translation in this volume (namely that 
of 1958) is the work of Stefan Bauer-Mengelberg and Jean van Heijenoort. 

Logical symbols. The logical symbols used in Godel’s original articles 
are here presented intact, even though these symbols may vary from one ar- 
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tide to another. Authors of introductory notes have in some cases followed 
the notation of the article(s) discussed and in other cases have preferred 
to make use of other, more current, notation. Finally, logical symbols are 
sometimes used to abbreviate informal expressions as well as formal opera¬ 
tions. No attempt has been made to impose uniformity in this respect. As 
an aid to the reader, we provide the following glossary of the symbols that 
are used in one way or another in these volumes, where ‘A’, L B’ are letters 
for propositions or formulas and ‘A(x)’ is a propositional function of x or 
a formula with free variable l x'. 

Conjunction (“A and B ”): A. B, A A B, A & B 
Disjunction (“A or JET): A V B 
Negation (“not A”): A, ~A, ->A 

Conditional, or Implication (“if A then B ”): A D B, A B 
Biconditional (“A if and only if B ”): A DC B, A = B, A ~ B, A *-> B 
Universal quantification (“for all x , A(x)”): (x)A(x), IIxA(x), xIl(A(x)), 
(Vx)A(x) 

Existential quantification (“there exists an x such that A(x)”): (Ex)A(x), 
ExA(x), (3x)A(x) 

Unicity quantification (“there exists a unique x such that A(x)”): 

( E\x)A(x ), X!xA(x), (3!x)A(x) 

Necessity operator (“A is necessary”): OA, NA 

Minimum operator (“the least x such that A(x)”): ex(A(x)), px{A(xj) 
Provability relation (“A is provable in the system S”): S h A 
Note: (i) The “horseshoe” symbol is also used for set-inclusion, i.e., for 
sets X, Y one writes X C Y (or Y D A) to express that A is a subset 
of Y. (ii) Dots are sometimes used in lieu of parentheses, e.g., Ad.B D A 
is written for A D (B D A). 

Typesetting. These volumes have been prepared by the TgX comput¬ 
erized mathematical typesetting system (devised by Donald E. Knuth of 
Stanford University), as described in the Preface to Volume I. The result¬ 
ing camera-ready copy was delivered to the publisher for printing. The 
computerized system was employed because: (i) much material, including 
the introductory notes and translations, needed to undergo several revi¬ 
sions; (ii) proof-reading was carried on as the project proceeded; (iii) the 
papers could be prepared in a uniform, very readable form, instead of 
being photographed from the original articles. Choices of the various type¬ 
setting parameters were made by the editors in consultation with the pub¬ 
lisher. Primary responsibility for preparing copy for the typesetting system 
lay with Gregory H. Moore, and the typesetting itself was carried out by 
Yasuko Kitajima. 

For all previously published articles, original pagination is indicated 
herein by numbers in the margins, with vertical bars in the body of the 
text used to show the exact page breaks. No page bar or number is used 
to indicate the initial page of an article. 
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Footnotes. We use a combination of numbering and lettering, as fol¬ 
lows. All footnotes for Godel’s texts and their translations are numbered, 
with only rare exceptions, as in the original. There is, however, one special 
case, that of 1972, in which Godel provided a second series of footnotes, 
essentially to preserve the original series from 1958 without change of num¬ 
bering. The new series is here distinguished by boldface lower-case Roman 
letters. For all the other material in this volume, footnotes are indicated 
by lightface lower-case Roman letters. 

Godel’s Nachlass. The scientific Nachlass of Kurt Godel was donated to 
the Institute for Advanced Study in Princeton, N.J., by his widow Adele 
shortly after his death. The Nachlass consists of unpublished manuscripts, 
lecture notes, course notes, notebooks, memoranda, correspondence and 
books from Godel’s library. It was catalogued at the Institute for Ad¬ 
vanced Study during the years 1982-1984 by John Dawson. Early in 
1985 the Nachlass with its catalogue was placed on indefinite loan to the 
Manuscripts Division (located in the Rare Book Room) of the Firestone Li¬ 
brary at Princeton University, where the material is available for scholarly 
examination. All rights for use still reside, however, with the Institute for 
Advanced Study. Though the Nachlass is referred to only here and there 
in Volumes I and II, it will be the source of almost all the material in sub¬ 
sequent volumes. For further information concerning its general character, 
see Volume I, pages 26-28. 

Photographs. Primary responsibility for securing these lay with John 
Dawson. Their various individual sources are credited in the Permissions 
section, which follows directly. 



Copyright permissions 


The editors are grateful to the following copyright holders, who have 
granted permission to reprint in this volume the works by Godel listed 
below. (See the References for full bibliographic citations.) 

The American Mathematical Society, for the following articles: 

“The consistency of the generalized continuum hypothesis” (1939), 
reprinted from Bulletin of the American Mathematical Society, vol. 
45, p. 93, by permission of the American Mathematical Society. 

“Rotating universes in general relativity theory” (1952), reprinted 
from Proceedings of the International Congress of Mathematicians, 
Cambridge, Massachusetts, 1950, pp. 175-181, by permission of the 
American Mathematical Society. 

Postscript to “Provably recursive functionals of analysis: A con¬ 
sistency proof of analysis by an extension of principles formulated 
in current intuitionistic mathematics”, by Clifford Spector (1962), 
reprinted from Recursive function theory, Proceedings of symposia 
in pure mathematics, vol. 5, p. 27, by permission of the American 
Mathematical Society. 

Professors Paul Benacerraf and Hilary Putnam, for the revised version 
(1964) of the article “What is Cantor’s continuum problem?”, published 
in their anthology Philosophy of mathematics: Selected readings, Copyright 
© 1964 by Prentice-Hall, Inc., Englewood Cliffs, New Jersey, and Basil 
Blackwell, Oxford. 

Dialectica, for the article “Uber eine bisher noch nicht beniitzte Erweiterung 
des finiten Standpunktes” (1958) and for permission to prepare and publish 
an English translation of it. 

The Institute for Advanced Study, Princeton, literary executors of the es¬ 
tate of Kurt Godel, for the following articles, in which copyright was re¬ 
tained by Godel: 

“Remarks before the Princeton bicentennial conference on problems 
in mathematics” (1946) 

“On an extension of Unitary mathematics which has not yet been 
used” (1972) 

“Some remarks on the undecidability results” (1972a). 



Copyright permissions 


Part of 1972a appeared earlier in From mathematics to philosophy , by Hao 
Wang, Copyright © 1974 by Routledge and Kegan Paul, London, and 
Humanities Press, New York. The editors thank Professor Wang for his 
permission to include that material in this volume. 

North-Holland Publishing Company, Amsterdam, for Godel’s untitled re¬ 
marks 1974, extracted from the preface to the second edition of Non¬ 
standard analysis , by Abraham Robinson. 

Open Court Publishing Company, La Salle, Illinois, for the following arti¬ 
cles: 

“Russell’s mathematical logic” (1944)- Reprinted from The philoso¬ 
phy of Bertrand Russell, by Paul A. Schilpp (ed.), by permission of 
the Open Court Publishing Company, La Salle, Illinois. Copyright 
1944, 1951 and © 1970 by The Library of Living Philosophers, Inc. 

“A remark about the relationship between relativity theory and ide¬ 
alistic philosophy” (1949a). Reprinted from Albert Einstein, philoso¬ 
pher-scientist, by Paul A. Schilpp (ed.), by permission of the Open 
Court Publishing Company, La Salle, Illinois. Copyright 1949, 1951 
and © 1970 by The Library of Living Philosophers, Inc. 

Princeton University Press, for the monograph The consistency of the con¬ 
tinuum hypothesis, Annals of Mathematics Studies No. 3. Copyright 1940 
by Princeton University Press, © renewed 1968 by Princeton University 
Press. Reprinted by permission of Princeton University Press. 

In addition, the editors thank the following for consenting to the repro¬ 
duction in this volume of the photographs listed below: 

Professor Richard Arens, of the University of California at Los Angeles, for 
his portrait of Kurt Godel and Albert Einstein; 

The Institute for Advanced Study, Princeton, for the portrait of Kurt Godel 
and Alfred Tarski, found in Godel’s Nachlass ; 

The Seeley G. Mudd Manuscript Library, Princeton University, for the 
group photograph of participants in the Princeton bicentennial conference 
on problems in mathematics; 

Mr. Arnold Newman, New York City, for his portrait of Kurt Godel; 

Dr. Veli Valpola, Espoo, Finland, for the portrait of Kurt Godel in his 
office. 



Contents 


Volume II 

List of illustrations.xi 

Godel 1938: Introductory note to 1938, 1939, 1939a and 1940, 

by Robert M. Solovay. 1 

The consistency of the axiom of choice and of the generalized 

continuum hypothesis.26 

Godel 1939: (See introductory note under Godel 1938.) 

The consistency of the generalized continuum hypothesis .... 27 

Godel 1939a: (See introductory note under Godel 1938.) 

Consistency proof for the generalized continuum hypothesis ... 28 
Godel 1940: (See introductory note under Godel 1938.) 

The consistency of the axiom of choice and of the generalized 

continuum hypothesis with the axioms of set theory .33 

Godel 1944 : Introductory note to 1944, by Charles Parsons . . . 102 

Russell’s mathematical logic. 119 

Godel 1940: Introductory note to 1946, by Charles Parsons . . . 144 

Remarks before the Princeton bicentennial conference 

on problems in mathematics. 150 

Gidel 1947: Introductory note to 1947 and 1964, 

by Gregory H. Moore. 154 

What is Cantor’s continuum problem?. 176 

Godel 1949: Introductory note to 1949 and 1952, 

by S. W. Hawking. 189 

An example of a new type of cosmological solutions 

of Einstein’s field equations of gravitation. 190 

Godel 1949a: Introductory note to 1949a, by Howard Stein . . . 199 

A remark about the relationship between relativity theory 

and idealistic philosophy. 202 

Godel 1952: (See introductory note under Godel 1949.) 

Rotating universes in general relativity theory. 208 














Contents 


xiv 

Godel 1958: Introductory note to 1958 and 1972, 

by A.S. Troelstra. 217 

Uber eine bisher noch nicht beniitzte Erweiterung 

des finiten Standpunktes. 240 

On a hitherto unutilized extension of the finitary standpoint . . 241 

Godel 1962: Postscript to Spector 1962 . 253 

Godel 1964 '- (See introductory note under Godel 1947.) 

What is Cantor’s continuum problem?. 254 

Godel 1972: (See introductory note under Godel 1958.) 

On an extension of finitary mathematics which has not 

yet been used. 271 

Godel 1972a: Introductory note to 1972a, by Solomon Feferman, 

Robert M. Solovay and Judson C. Webb. 281 

Some remarks on the undecidability results. 305 

Godel 1974'- Introductory note to 1974, by Jens Erik Fenstad . . 307 

[Remark on non-standard analysis]. 311 

Textual notes. 313 

References. 327 

Addenda and corrigenda to Volume I. 389 

Index. 393 















List of illustrations 


Portrait of Kurt Godel, 1956, by Arnold Newman.frontispiece 

Participants in the Princeton bicentennial conference 

on problems in mathematics, December 1946 . 142 

Key to conference participants. 143 

Albert Einstein and Kurt Godel at the Institute 

for Advanced Study, about 1954 . 188 

Kurt Godel in his office at the Institute for 

Advanced Study, May 1958 . 218 

Kurt Godel and Alfred Tarski, March 1962 . 252 









This page intentionally left blank 



Kurt Godel 

COLLECTED WORKS 
Volume II 



This page intentionally left blank 



Introductory note to 1938 , 1939 , 
1939a and 1940 


1. Introduction 

The papers discussed in this introductory note deal with Godel’s proof 
of the relative consistency of the axiom of choice and of the generalized 
continuum hypothesis with the usual axioms for set theory. a The note 
1938 announces the results and gives a brief discussion of the ideas un¬ 
derlying their proofs; 1939 is an abstract in which the results are again 
announced. The paper 1939a gives considerable technical detail con¬ 
cerning the proofs; essentially complete proofs, along somewhat different 
lines, are presented in 1940. 

In the next section, I shall describe briefly the historical context of 
Godel’s work. b The third section is devoted to the results themselves, 
the problems that they solved, and the methods used to obtain them. 
In the final section, the new problems raised by Godel’s work and the 
results concerning them obtained by later workers will be discussed. 

2. Historical antecedents 

Godel’s work bore on two previously considered questions, the axiom 
of choice and the continuum hypothesis. 

2.1 The axiom of choice 

Cantor had conjectured the proposition, now called the well-ordering 
theorem, that every set can be well-ordered. 0 In 1904 Zermelo gave a 
proof of this conjecture, using in an essential way the following mathe¬ 
matical principle: for every set X there is a choice function, f, which 
is defined on the collection of non-empty subsets of X, such that for 
every set A in its domain we have f(A) e A. Subsequently, in 1908, 
Zermelo presented an axiomatic version of set theory in which his proof 
of the well-ordering theorem could be carried out. One of the axioms 


a This is here taken to be the Zermelo-Praenkel (ZF) system of axioms for set 
theory excluding the axiom of choice, explained in Section 2 below. 

b I have been greatly helped in preparing Section 2 by conversations and corre¬ 
spondence with Gregory H. Moore. 

C A well-ordering of a set X is a linear ordering of X such that every non-empty 
subset of X has a least element. 
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was the principle just stated, d which Zermelo referred to as the “Axiom 
der Auswahl” (the axiom of choice, abbreviated AC). 

Zermelo’s proof was the subject of considerable controversy. 6 The 
well-ordering theorem is quite remarkable, since, for example, there is 
no obvious way to define a well-ordering of the set of real numbers. f 
Nor is such an explicit well-ordering provided by Zermelo’s proof. Thus 
many people who thought Zermelo’s result implausible cast doubt upon 
the validity of AC. The other set-existence axioms all have the form 
that some collection of sets, explicitly definable from certain given pa¬ 
rameters, is itself a set. The axiom of choice, on the other hand, asserts 
the existence of a choice function but does not provide an explicit defi¬ 
nition of such a choice function. Zermelo was well aware that his axiom 
had this purely existential character, but many other mathematicians 
were uncomfortable with existence proofs that did not provide the con¬ 
struction of specific examples of what was asserted to exist. 

The work of Godel dealt with here showed that AC is “safe” in the 
following sense: If the usual axioms of set theory (including the axiom 
of foundation but excluding AC) do not lead to a contradiction, then 
they remain consistent when AC is adjoined as an additional axiom. 

2.2 The continuum hypothesis 

In his theory of infinite cardinals Cantor proved (making essential but 
implicit use of AC) that the totality of all infinite cardinal numbers is 
well-ordered (and in fact is order-isomorphic to the totality of all ordinal 
numbers). However, an important question was left open by Cantor’s 
work. Let c be the cardinal number of the set of real numbers (or, as this 
set is sometimes referred to, the continuum). Cantor showed that c is 
not the first infinite cardinal, but he was unable to determine its precise 
place in the hierarchy of infinite cardinals. He conjectured, however, that 
c is precisely equal to Ni, the second infinite cardinal ( Cantor 1883). 
This conjecture became known as the continuum hypothesis ( CH ). It 
is easily shown that c = 2 H °, and so CH is equivalent to the statement 
2 n ° = Ni- A natural generalization, considered later by Hausdorff (1908) 
and called the generalized continuum hypothesis (GCH ), asserts that for 
every ordinal a, 2 Ha = K q+ i. 


d Zermelo’s precise formulation in his Axiom VI was slightly different from the 
one we have given here, but they are easily proved equivalent in the presence of the 
other axioms. 

e Moore 1982 has an exhaustive discussion of the history of this controversy. 
f We now know, thanks to the work of Cohen (1963, 1964) and Feferman (1965), 
that it is consistent with all the usual axioms of set theory (including the axioms 
of choice and foundation) that there is no definable well-ordering of the set of real 
numbers. 
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Gddel did not succeed in settling whether or not CH is true. But he 
was able to show that the usual axioms of set theory do not disprove 
CH, so that if they settle its truth value at all, it must be a theorem. 
As it turned out, Cohen was able to show in 1963 that the latter also 
does not hold. Godel’s method of proof for the consistency of CH with 
the axioms of set theory (including the axiom of choice), to be described 
below, worked just as well to establish the consistency of GCH with 
those axioms. 

2.3 Axiomatizations of set theory 

Modern axiomatizations of set theory are all derived from Zermelo 
1908. However, the systems of Skolem and Fraenkel that have come to 
replace Zermelo’s differ from it in several respects: 8 

(a) Zermelo’s original axioms allowed for individuals (or urelements) 
that are members of sets but are not sets themselves. Thus, Zermelo’s 
version of the axiom of extensionality said that two sets with precisely 
the same members are equal. The modern axiomatization considers only 
pure sets; thus the variables of the theory range only over sets, and the 
axiom of extensionality takes the form that whenever x and y have the 
same members, they are equal. 

(b) A key axiom schema of set theory, separation, expresses the fol¬ 
lowing: if a: is a set and P is a property, then there is a set y whose 
members are precisely those members of x that have the property P. 
Clearly some care is needed in the precise formulation of this axiom 
schema. Zermelo introduced a new undefined notion of “definite prop¬ 
erty” . (He says “A question or assertion ... is said to be definite if the 
fundamental relations of the domain, by means of the axioms and the 
universally valid laws of logic, determine without arbitrariness whether 
it holds or not”.* 1 ) He then required that the property P be definite in 
this sense. 

While perhaps adequate for mathematical practice, Zermelo’s treat¬ 
ment of this axiom schema was not precise enough for metamathemati- 
cal investigations. For these purposes, one needs a precise set of axioms 
in an unambiguously defined formal language. The solution (found by 
Skolem in his 1923) is to allow only those properties P that are ex¬ 
pressible in the appropriate formal language for set theory. 1 Present-day 


^Section 4.9 of Moore 1982 has an excellent discussion of the historical process 
by which ZF evolved from the axiomatic theory of Zermelo 1908. 
h I am quoting from the translation in van Heijenoort 1967, p. 201. 

‘Of course, the idea of a formal first-order language was not as familiar in 1923 
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versions of axiomatic set theory handle the separation schema in the 
manner of Skolem. 

(c) Even after Zermelo 1908 is amended as suggested in (a), it still 
permits the existence of anomalous sets. For example, it is possible to 
have a model of Zermelo’s theory in which there is a set x whose sole 
member is x itself. This and other related anomalies are ruled out by 
the axiom of foundation, which asserts that every non-empty set x has 
a member y which has no members in common with x. 

Another formulation of the axiom of foundation with a clearer con¬ 
ceptual meaning will be given after we discuss the replacement axiom. 

(d) Both Fraenkel and Skolem pointed out that Zermelo’s system of 
axioms could not carry out certain constructions permitted in Cantor’s 
“naive set theory”. For example, let Zq be the set of non-negative in¬ 
tegers and let Zi +1 be the power set of Z, for few. Then Zermelo’s 
system is unable to prove the existence of the set whose members are 
the Zi s. 

This defect is handled by adjoining a new axiom schema, that of re¬ 
placement, , which asserts roughly the following: let P(z) be a “definite 
property” in the precise sense above. Suppose that, for every set x, there 
is precisely one y such that P{{x, y))) We can think of P as determining 
a function F whose domain is the collection of all sets. Then, for every 
set a, there is a set b whose members are precisely the values of F(y ) for 
V e a. 

The modern axiomatization of set theory, ZFC, is obtained by mak¬ 
ing these four changes in Zermelo’s 1908 paper. (The version without 
the axiom of choice is denoted ZF.) The system obtained from ZFC 
(respectively, ZF) by dropping the axiom schema for replacement is de¬ 
noted ZC (respectively, Z). 

We'can now describe the more conceptual proposition which (in the 
presence of all the axioms of ZF except the axiom of foundation) is 
equivalent to the axiom of foundation. (The proof makes essential use 
of the replacement schema and cannot be carried out in Z or even ZC.) 

The levels in the cumulative hierarchy are the sets R(a), defined for 
all ordinals a by transfinite induction on a as follows: 

(i) R( 0) = 0; 

(ii) if a = /3 + 1, then R(a) = P(R(j3)) (where P(a;), the power set of 
x , is the collection of all subsets of x)\ 

(iii) if cc is a limit ordinal, then R(a) = (J R( 7 ). 

7<a 

The promised equivalent to the axiom of foundation is the proposition 
that every set is a member of one of the levels R(a ) of the cumulative 
hierarchy. 


JHere (x,y) is the ordered pair of x and y, defined in the usual way due to 
Kuratowski (1921). 
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Present day research in set theory concentrates on the theory ZFC. 
However, in 1940 Godel worked with a different version of axiomatic set 
theory, due to Bernays and Godel, which he denoted by £ but which is 
now customarily denoted BG. k In the system BG there are two differ¬ 
ent sorts of variables. First of all there are variables which range over 
sets. Intuitively, these sets may be identified with the sets which are 
the range of the variables of ZFC. In addition, there are class variables. 
The intuitive picture is that classes are collections of sets. The axioms 
of BG ensure that every set is a class, that every member of a class 
is a set and conversely, and that two classes which have precisely the 
same members are equal; but in addition to sets, BG provides for the 
existence of classes, called proper classes, which are “too large” to be 
sets. For example, there is a universal class, V, which has every set as 
a member. 

While every theorem of ZF is a theorem of BG, a striking difference 
is that BG is finitely axiomatizable while ZF is not. The reason why ZF 
requires infinitely many axioms is that each of the two axiom schemas 
(of separation and replacement) has infinitely many instances. Each of 
these schemas corresponds to a single axiom of BG. (For example, the 
axiom schema of separation corresponds to the assertion that, if A is a 
class and a; is a set, there is a set y whose members are precisely the 
sets which are members of both A and x.) BG has only finitely many 
axioms of class existence, but they suffice to prove that, for any prop¬ 
erty P of sets which is expressible in the language of ZF (with particular 
sets allowed as parameters in the definition), BG can prove that there 
is a class whose members are precisely the sets with property P. 1 (The 
finitely many axioms of BG that are needed correspond roughly to the 
finitely many basic predicates and logical connectives of the language of 
ZF.) 

To state the next significant fact about the Bernays-Godel system, 
we agree that henceforth BG will refer to the version without the axiom 
of choice, while BGC will refer to the version with the “global axiom of 
choice”. This principle asserts that there is a single function F (neces¬ 
sarily a proper class) which selects a member from every non-empty set. 
Mostowski proved that if ^ is a sentence in the language of ZF, then <j> 
is a theorem of ZF if and only if it is a theorem of BG; in other words, 
BG is a conservative extension of ZF. m It turns out that BGC is also 


k The system BG grew out of earlier work by von Neumann. Cf. von Neumann 
1925, 1928, Bernays 1937, 1941, 1942, and Godel 1940. 

'This and more is proved in the metatheorem Ml of Chapter II of 1940. 
m Mostowski’s proof is sketched in footnote 6 on page 112 of Mostowski 1950. 
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a conservative extension of ZFC. This latter result was proved at about 
the same time by Kripke, Cohen and the present writer." 

Finally, we remark that natural models of the different theories we 
have discussed can be found at suitable stages of the cumulative hi¬ 
erarchy. (For all the models so considered, we take the c-relation of 
the model to be the restriction of the usual membership relation to its 
sets.) In particular, if a is a limit ordinal greater than w (for example, 
a = + uj), then R(a) is a model of ZC. 

Before stating the corresponding result for natural models of ZFC, we 
need to recall some notions about infinite cardinals. An infinite cardinal 
k is regular if k is not the sum of fewer than n many cardinals less than 
k. An infinite cardinal k is a strong limit cardinal if, whenever A is a 
cardinal less than k, then also 2 X < k. Finally, an infinite cardinal k is 
strongly inaccessible if it is regular, is greater than N 0 > and is a strong 
limit cardinal. If is a strongly inaccessible cardinal, then R(Q) is a 
model of ZFC. We can obtain a model of BGC by taking the sets of the 
model to be the members of f?(f2) and the classes of the model to be the 
subsets of R(Q,).° 

There is another construction of “natural models of set theory” we 
shall need to refer to later, for which some preliminary definitions are 
required. A set x is said to be transitive if (Vt/ e x)(y C x). One can 
prove in ZFC that for every set x there is a smallest transitive set y such 
that x C y. This y is called the transitive closure of x. For A an infinite 
cardinal, H( A) is the collection of all sets x whose transitive closure has 
cardinality less than A. (One can prove in ZFC that H( A) is a set.) If 
A is a regular cardinal greater than H 0 , then H( A) is a model of all the 
axioms of ZFC except possibly the power set axiom. If A is a strong 
limit cardinal greater than K 0 , then H(X) is a model of ZC. Finally, if A 
is strongly inaccessible, then H{ A) is a model of ZFC. (In this last case, 
one can show that H( A) = R( A).) 

This completes our review of the different versions of axiomatic set 
theory and their simplest models; we now return to our discussion of the 
historical antecedents of Godel’s work on constructibility. 

Godel’s method, which has subsequently become known as the “inner 
model” method, proceeded as follows: he described a certain collection 
of sets, called the constructible sets, and was able to prove (in axiomatic 
set theory without the axiom of choice) that each of the axioms of set 
theory holds in the domain of constructible sets. He also showed that 


"Subsequently, the proof was rediscovered and published in Feigner 1971. 

“Since the results just stated can be proved in ZFC, it follows by Godel’s sec¬ 
ond incompleteness theorem that the existence of an inaccessible cardinal cannot be 
proved in ZFC if ZFC is consistent. 
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AC and GCH hold in this domain. From this it follows easily that if 
the axioms of set theory became inconsistent after adjoining AC and 
GCH, then they must already have been inconsistent without these new 
axioms. For a proof of 0 = 1 from the new axioms could be recast as a 
proof that 0=1 holds in the domain of all constructible sets, an obvious 
contradiction. 

A certain amount of model-theoretic work had been done in set theory 
prior to Godel’s work. When Zermelo presented his axiomatization of set 
theory in 1908, he already raised the question of the system’s consistency 
(“I have not yet even been able to prove rigorously that my axioms are 
consistent, though this is certainly very essential.. .” p ); he also remarked 
that his axioms appeared to be independent of each other, but made no 
attempt at a proof. The first real endeavors to work with models of set 
theory were by Fraenkel (1922, 1922a) and Skolem (1923). Fraenkel 
attempted* 1 to show the independence of a number of Zermelo’s axioms, 
particularly the axioms of choice and separation. However, Fraenkel 
was definitely not thinking in terms of first-order logic, whereas Skolem 
was. Naturally, this had profound effects on the sort of models that they 
considered. Skolem was very interested in countable models of set theory 
and noted that such a model, even if its natural numbers are standard, 
will omit some set of natural numbers. He raised the question whether 
one can add such a set of natural numbers and still have a model of 
Zermelo set theory. Also, he argued that the continuum hypothesis is 
probably neither proved nor disproved by Zermelo’s axioms. 

There matters sat for a while. Then von Neumann in his 1929 gave 
the first relative consistency proof, that for the axiom of foundation; 
that is, he showed that if set theory without the axiom of foundation is 
consistent, then it remains consistent when the axiom is added. Later, 
Ackermann (1937) gave a proof that if number theory is consistent, so is 
ZFC minus the axiom of infinity. Godel’s discovery of constructible sets 
and their use in proving the relative consistency of the axiom of choice 
dates from 1935; the proof of the relative consistency of CH (and, in 
fact, of GCH) came later, apparently in 1937. r 


p This passage is quoted from the translation in van Heijenoort 1967, pp. 200-201 
Of course, in light of Godel’s second incompleteness theorem, it is unreasonable to 
hope for a proof of the consistency of the Zermelo axioms using means formalizable 
within Zermelo’s system. 

q Fraenkel’s work did not meet modern standards of rigor. Completely adequate 
versions of Fraenkel’s proofs were given in Lindenbaum and Mostowski 1938 ; see 
Mostowski 1939. 

r For further details on the evolution of Godel’s proof and other references, see p. 
158 below as well as these Works, Volume I, pp. 9, 21-22; the dating 1937 for the 
relative consistency of GCH comes from an item in Godel’s Nachlass (op. cit., p. 36, 
fn. s). Cf. also Moore 1982, pp. 280-283. 
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As we shall see more clearly in the next section, Godel’s constructible 
hierarchy (which is used to define the class of constructible sets) can be 
viewed as a variant of the cumulative rank hierarchy. The rank hierar¬ 
chy had first been clearly stated by Zermelo in his 1930, in the context 
of second-order models of set theory. s In a sense, Godel combined the 
rank hierarchy of Zermelo with the first-order perspective of Skolem in 
order to obtain the hierarchy of constructible sets. 

Another antecedent of Godel’s constructible hierarchy is the rami¬ 
fied theory of types of Russell and Whitehead. Indeed, Godel explic¬ 
itly states that his constructible hierarchy can be viewed as the natural 
prolongation to transfinite levels of the ramified theory of types (1944: 
page 147). The most striking expression of this connection appears in a 
letter of Godel to Hao Wang, dated 7 March 1968 and quoted on page 10 
of Wang 1974- There Godel attests to the fruitfulness of his platonistic 
attitudes for his research in the foundations of mathematics. Referring 
to his work on the consistency of CH, he says, “However, as far as, in 
particular, the continuum hypothesis is concerned, there was a special 
obstacle which really made it practically impossible for constructivists to 
discover my consistency proof. It is the fact that the ramified hierarchy, 
which had been invented expressly for constructive purposes, has to be 
used in an entirely nonconstructive way." The essentially nonconstruc¬ 
tive element lies in the use of arbitrary ordinals as the levels in Godel’s 
extension of the ramified theory. 

3. Description of the proof 

The outline we shall give is substantially that of Godel 1939a. In that 
brief note, the details of the proofs that the axioms of ZFC hold in L 
and of the absoluteness arguments needed to establish that V = L holds 
in L are not given, but all the key notions and ideas are explained. 

The notion of constructible set is best defined in terms of an auxil¬ 
iary hierarchy of sets, the L a ' s, which are defined, for all ordinals a, by 
transfinite induction on a, as follows: 

(i) Lo = 0; 

(ii) if a = (3 + 1, then L a consists of all the subsets of Lp that are 
definable by a first-order formula of set theory, possibly containing pa¬ 
rameters from Lp, when the variables of that formula are interpreted as 
ranging over Lp; 

(iii) if a is a limit ordinal, then L a = U f r 

y<a 


s Mirimanoff { 1917 ) had introduced the cumulative hierarchy, but was not influ¬ 
ential; von Neumann { 1929 ) used it, but in a very confusing way. 
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Finally, a set is constructible if and only if it appears in some L a ; 
while the class L of constructible sets is not in the range of the variables 
of ZF, the property L(x) of being constructible is definable in ZF, in the 
form 3a(x e L a ). 

A number of comments on this definition are in order. First, it should 
be contrasted with the description of the universe of sets in terms of the 
cumulative hierarchy. There the form of the definition is exactly the 
same as the one just given if a is 0 or a limit ordinal, but at the crucial 
successor case, R(a + 1) is the collection of all subsets of R(a). 

Thus the constructible hierarchy is obtained by modifying the usual 
definition of the cumulative hierarchy to be far more parsimonious in 
adding subsets of the collection of sets already defined. Intuitively, at 
stage a + 1 one throws in only those subsets of L a that must appear in 
any possible model of set theory that contains the set L a . (However, in 
contrast to the cumulative hierarchy, some new subsets of L a may first 
appear at stages later than a + 1; indeed, this happens for all infinite 
stages a.) 

We now explain the essentials of Godel’s proof (outlined in 1939a) 
that L, the totality of all constructible sets, is a model for all the axioms 
of ZF together with AC and GCH } 

Even though the variables of ZF range only over sets and not over 
proper classes, it is standard in expositions of ZF set theory to allow lim¬ 
ited reference to proper classes when this is done in such a way that the 
discussion could in principle be expressed solely in terms of sets. Thus, 
for example, the (true) assertion that every ordinal is constructible might 
be expressed by On C L. (Here On is the class of all ordinals.) A more 
detailed discussion of this point can be found in Chapter 1, Section 9, of 
Kunen 1980. Our discussion of absoluteness in the following paragraph 
should be taken in this spirit. 

We first introduce the very important notion of ... ,x n ) be¬ 
ing absolute from a transitive class M to a transitive subclass A r . u The 
formula (j> is absolute from M to N if and only if, whenever xi,...,x n 


‘The word “model” has to be taken with a grain of salt. For each particular 
axiom of ZF, the statement that that axiom holds in the domain of the constructible 
sets can be formulated in the language of ZF and is in fact a theorem of ZF. Since 
we cannot, in any obvious way, formulate in a single sentence of the language of ZF 
the assertion that all the axioms of ZF hold in L, we are not in danger of running 
afoul of Godel’s second incompleteness theorem on the unprovability of consistency. 
At the same time, this understanding of the model-theoretic approach allows one to 
establish the consistency of ZFC + GCH relative to ZF. 

U A class X of sets is called transitive if, whenever a; is a member of X, then x is 
a subset of X. For example, L and the class V of all sets are transitive classes. 
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are members of N, (j>{x i,..., holds in the structure N v if and only 
if it holds in the structure M. 

It is straightforward to verify that the axioms of ZF hold in L. The 
axiom of separation is the hardest to check. Here the key idea is that, by 
the same argument used to prove the reflection principle of set theory," 
there will be many stages L a at which some preassigned formula will 
be absolute from L to L a . So if x e L n , a subset of x defined by a 
formula whose variables are interpreted as ranging over all of L will be 
defined equally well by the same formula with its variables interpreted 
as ranging over L a \ but then this subset will appear in L a +i- 

The remainder of Godel’s argument consists in showing that A C and 
GCH hold in the constructible universe. This is done in two steps. First, 
it is shown that these propositions follow from the proposition that ev¬ 
ery set is constructible, a proposition now customarily referred to as the 
axiom of constructibility and symbolized by the equality V = L. Sec¬ 
ond, it is shown that the proposition V = L holds in the constructible 
universe. 

We take up the second point first. It is natural to think that this 
is a trivial matter, since from the standpoint of L, the universe of 
sets consists precisely of those sets lying in L. But there is a subtle 
difficulty to recognize and deal with. A set x in L might have some 
property, such as being a cardinal number or being a constructible set, 
in V. but the same property, interpreted in L, might not hold of x. 
(This possibility definitely can happen for the property “is not a car¬ 
dinal number”.) In fact, one can show that the L a ’s, when computed 
in L, are exactly the same as the L a ’s when computed in V. In order 
to show this, it is necessary to make a detailed study of those oper¬ 
ations and notions that are absolute from V to L. It turns out that 
the operation of forming the set of all first-order definable subsets of a 
given set is absolute; the operation of forming the full power set is not. 


v That is, with the e-relation interpreted as usual in N, and with the quantifiers 
interpreted as ranging over the elements of N. 

"The reflection principle asserts that for any finite set of formulas 4? (of the 
language of set theory) there are arbitrarily large ordinals a such that each formula 
<p in $ is absolute from V to R(a). The proof of this principle runs roughly as follows: 
We may assume the collection.$ is closed under the taking of subformulas. Assume 
further that in formalizing first-order logic, we have taken the existential quantifier 
as basic (and defined the universal quantifier in terms of it). For each formula <t> of 
4> that begins with an existential quantifier, we introduce a corresponding Skolem 
function, / ? ,. It is easy to verify the following two facts: (1) If an R(a) is closed 
under all the functions then all the formulas in $ are absolute from V to R(a). 
(2) There are unboundedly many ordinals a such that R(a) is closed under all the 
functions f,i, . ((2) is an easy consequence of the replacement axiom schema of ZF.) 

Godel 1939a does not discuss the verification of the axioms of ZF in L. However, 
a very similar use of Skolem functions occurs in the proof of Theorem 2 of 1 939a. 
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One byproduct of this investigation of absoluteness is the following 
intrinsic characterization of L: It is the minimal transitive class-model 
of the axioms of ZF that contains all the ordinals. This characterization 
shows the fundamental nature of the concept of constructible set. 

We now turn to the proof that the axiom of choice and the generalized 
continuum hypothesis follow from the proposition V — L. For AC, this 
is fairly easy. One can define, by induction on a, a well-ordering W a of 
L a . A little care is needed to make sure that, at a limit stage A, the 
union of the WVs for a < A is a well-ordering for L\. But the heart of 
the argument is to see how to go from a well-ordering of L a to one for 
L a+1 . A definition of a set in L a+ 1 may be viewed as a finite sequence of 
symbols, each of which is either an integer or a member of L a . It is easy 
to well-order the totality of such sequences using the given well-ordering 
of L a ; one then well-orders L a+ \ by putting the elements already in L a 
first, arranged in the order W a , and then ordering the new elements in 
the same order as their minimal definitions.* 

The proof that V = L implies GCH is more subtle. The key lemma 
is the following, which appears as Theorem 2 of 1939a : 

Let A be an infinite cardinal, and let x be an arbitrary subset of 

A; then if V — L, x is a member of L\+. 

Here A + is the least cardinal greater than A. It is quite easy to show 
that L x + has cardinality equal to A + . Thus the lemma will imply that 
GCH holds in L. 

The proof of the lemma is analogous to the proof of the downward 
Skolem-Lowenheim theorem. Since V = L, x will appear in some L y , 
and, by increasing 7 if necessary, we may arrange that 7 is greater than 
A and that L 7 is a model of V = L and of some fixed finite subset T 
of the axioms of ZF which is sufficiently large to prove the fundamental 
properties of the L a ' s. If M and N are transitive set-models of T with 
M a subset of N, then the computation of the L Q ’s will be absolute from 
N to M. By the downward Skolem-Lowenheim theorem, we can then 
find an elementary submodel M of L 1 , of cardinality A, containing x, 
A and all the ordinals less than A. M will in general not be transitive, 
but it is e-isomorphic to a transitive model, say N (nowadays called the 
Mostowski collapse of M). 

Now N is a transitive model of the proposition V — L and of the 
finite fragment T of ZF. Absoluteness arguments show that, for ordinals 


x If one just wants to prove the relative consistency of ZFC to ZF, a simpler proof 
can be given using the notion of ordinal-definable set, first introduced by Godel in 
1946\ see the introductory note to 1946 in this volume. 
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6 e N, Lff = L$. Since V — L holds in N, N is just the union of the 
Lg’s for S an ordinal of N; i.e., N = Lq, where 6 is the least ordinal not 
in N. 

It remains to notice two points. First, the isomorphism between M 
and N is the identity on the ordinals < A and hence carries the set x 
to itself. (This is fairly easy to verify from the detailed proof that M is 
isomorphic to a transitive set.) So x lies in N = Lg. Second, the cardi¬ 
nality of 6 is less than or equal to the cardinality of N (or equivalently, 
the cardinality of M), which is less than or equal to A. So 9 is less than 
A + . This completes the sketch of the proof of the crucial lemma. 

Godel also notes that for A = K w , the model L\ gives a natural model 
of ZC + GCH. Similarly, if A is a strongly inaccessible cardinal, then L\ 
is a model of ZFC + GCH. These results are closely related to the re¬ 
sults about the natural models H( A) discussed at the end of Section 2 of 
this note. Indeed, it follows by arguments similar to those used to prove 
GCH in L that if V = L, then, for A an infinite cardinal, L\ = if (A). 
The result of Godel just cited follows directly/ 

The treatment in Godel 1940 is significantly different in its details 
from that outlined above. In the first place, instead of working with the 
theory ZF as we have done, Godel works with the Bernays-Godel set 
theory BG, discussed on page 5 above. 

Second, the definition of L that is used in 1940 is a good deal more 
ad hoc. A set of “eight fundamental operations” is introduced, and an 
enumeration of sets, F: On —> V, is given which is designed so that 
(a) the range of F is closed under the eight fundamental operations, 
and (b) at many limit stages A, F(X) is the set {F(a):a < A}. Then 
L is defined to be the range of this auxiliary function F. (Roughly, the 
eight fundamental operations are mathematical operations on classes 
that correspond to basic syntactic operations on formulas. For example, 
the operation of intersection corresponds to the syntactic operation of 
taking the conjunction of two formulas.) 

Finally, the proof of the fundamental lemma needed to establish 
GCH in L is presented in a very non-conceptual way that obscures 
the connection with the Skolem-Lowenheim theorem. While the proofs 
in 1940 are presented in full detail, very little motivation is given. It 
is natural to wonder why Godel presented his results in this way in 


y We recommend that the reader interested in learning more of the details of 
Godel’s work on L begin with 1938 and 1939a. For more detailed proofs, there 
are good treatments in several modern texts, notably Kunen 1980, Jech 1978, and 
Devlin 1973. (The reader of Kunen should note that a knowledge of his Chapter 2 
on combinatorics is not needed for an understanding of his treatment of the basic 
facts about Lina later chapter.) 
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1940, when it is clear from 1939a that he was well aware of the more 
conceptual proof outlined above. My guess is that he wished to avoid 
a discussion of the technicalities involved in developing the rudiments 
of model theory within axiomatic set theory. In 1939a the portions of 
the argument that would require such a treatment are passed over in 
silence, while in 1940 an alternative treatment is developed that avoids 
the necessity for such a formalization. 

Besides the results on AC and GCH , Godel (1938) mentions two 
propositions of descriptive set theory (i.e., the study of definable sets of 
real numbers) which hold in the model L. In order to state them, we 
must review some of the standard terminology of descriptive set theory. 
Let X be one of the spaces R”, where R is the set of real numbers. 2 A 
subset Y of X is Borel if it belongs to the smallest family of subsets 
of X containing the open sets of X and closed under complementation 
and countable unions. A subset Y of R n is Sj (or analytic) if it is the 
projection of a Borel subset of R" +1 (under the map that deletes the 
last component of an (n + l)-tuple). A subset Y of X is II{ if it is the 
complement with respect to X of some Sj subset of X. This hierarchy 
of subsets is continued as follows: A subset of R" is Ej. +1 if it is the 
projection of a 11*. subset of R n+1 . A subset of R™ is Hfe+i if it is the 
complement, relative to R n , of some £|. +1 set. A subset Y of X is A*, 
if it is both ££ and njj.. Finally, Y is projective if it is £*, for some 
integer k. 

We can now state the two propositions of descriptive set theory that 
Godel showed are valid in L : 

(1) There is a A 2 subset of R that is not Lebesgue measurable. 

(2) There is a nj subset of R that has cardinality c but contains no 
perfect subset. 

These results should be contrasted with the following theorems of 
ZFC (cf. Moschovakis 1980): 

(1') Every £* or IIj subset of R is Lebesgue measurable; a fortiori, 
every Borel subset of R is Lebesgue measurable. 

(2') Every subset of R is either countable or contains a perfect 
subset of cardinality c. 

The two results of Godel just cited are consequences of the fact that, 
assuming V — L, the restriction of the canonical well-ordering of L to 
the reals gives a good well-ordering. aa Godel gave no proof of (1) 
and (2) in 1939a and a cryptic proof of a few lines, comprehensible only 


z R n is viewed as a topological space in the usual way; with slight modifications, 
the definitions given here apply to any complete separable metric space X without 
isolated points. See Moschovakis 1980, the standard reference work on modern 
descriptive set theory. 

aa The word “good” has a technical meaning here that I shall not stop to explain. 
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to cognoscenti, in 1940. (There is a fuller proof in Godel’s handwrit¬ 
ten notes for his 1940 lectures.) A detailed proof of (1) and (2) was 
first published in Novikov 1951, and a proof of the result about a good 
Aj well-ordering of R was first given in Addison 1959. For a readable 
account, see Jech 1978, pages 527-530, or Moschovakis 1980, pages 274 
281. 


4. Further work 

In this fourth and final section, I shall describe subsequent work done 
on L and on the questions raised by the work of Godel. For historical 
reasons; specific references are given wherever possible; a good pair of 
general references that cover almost all of the following are Jech 1978 
and Devlin 1973. bh 

After Godel, the first work on L was done by Kuratowski and 
Mostowski. cc Mostowski reconstructed Godel’s proof that, in L, the reals 
have a A| well-ordering. Kuratowski showed that, using the projective 
well-ordering of the reals which Godel deduced from V = L, one could 
prove that various pathological sets previously constructed using the ax¬ 
iom of choice and the continuum hypothesis would be projective. 

In the late 1950s Godel’s “inner model method” was generalized 
slightly in the work of Hajnal, Levy and Shoenfield. dd They introduced 
a relative version of the notion of constructible set. For example, if x is 
a set of ordinals then L[x], the class of sets constructible from x, can be 
characterized as the minimal transitive class containing all the ordinals 
which is a model of ZF and which has rasa member. It may be shown 
that AC holds in L[x] by the same argument as above, and even that 
GCH holds in L[x\ when x C u. ee A typical further result is the follow¬ 
ing theorem of Levy and Shoenfield: If V = L follows from GCH, then 
V = L is already a theorem of ZF. 

It had already become clear in the early 1950s, thanks to the work 
of Shepherdson (1951-1953), s that the inner model method is quite 


bb Devlin 1984 is a revised version of Devlin 1973 that contains a great deal of 
interesting additional material. For example, it discusses Silver machines and the 
simplified morasses of Velleman. 

cc Cf. Addison 1959, p. 338. Mostowski’s manuscript was destroyed during the 
Second World War. Kuratowski’s work suffered a similar fate, but was later recon¬ 
structed and published as Kuratowski 1948. 

dd Cf. Hajnal 1956, 1961, Levy 1957, 1960b, and Shoenfield 1959. 
ee By a slightly more difficult argument one can show that GCH continues to hold 
if V = L[a ] and a C Mi. 

Similar results were obtained subsequently but independently by Cohen {1963a). 
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incapable of showing that V = L is not a theorem of ZFC. The reason 
is that one cannot rule out, on the basis of ZFC, the possibility that the 
universe of sets is minimal, i.e., has no non-trivial inner models. The 
existence of minimal models can be argued as follows. Evidently, a min¬ 
imal model must satisfy V = L. If there are no transitive set-models of 
V = L, then L itself can be shown to be minimal. If there are transitive 
set-models of ZFC, then by applying the Godel construction of L within 
such a model, one can easily conclude that there are transitive models 
of ZFC of the form L$. By taking 8 as small as possible, one obtains, 
once again, a minimal model of ZFC. 

Of course, Shepherdson’s analysis shows, a fortiori, that the inner 
model method is incapable of answering the following two natural ques- 
tions: gs 

(a) Is the continuum hypothesis a theorem of ZFC? 

(b) Can the proposition “The reals have a well-ordering” be proved 
without the aid of the axiom of choice? hh 

Thus Godel’s work raised the fundamental new question: Is V = L a 
theorem of ZFC? But, through the work on relative constructibility, it 
also provided an important clue to the solution. One could prove that 
if V = L is not a theorem of ZFC, then there is a model of ZFC of 
the form L[x ] for some set x of ordinals, with x not constructible. 11 In 
addition, it seemed highly plausible that x could be taken to be a set 
of integers. This reformulation was useful because the structure of the 
model L[x\ is quite transparent. In particular, the sets of the model 
L\x\ are naturally parametrized by the ordinals. Thus it was natural to 
phrase the problem (of showing that V — L is not a theorem of ZFC) as 
follows: Let M be a countable transitive model of ZFC which has the 
form Lg ; can we then find a subset x of w such that Lg[x] is again a 
model of ZFC? Unfortunately, though it is easy to pick x so that L$\x\ 
is not a model of the Replacement Axiom of ZFC (simply choose x to 
encode the ordinal 8), there was no obvious way to ensure that Lg[x] is 
a model of ZF (and hence of ZFC). 

The questions raised above were all settled by Paul Cohen with his 
development of the technique of forcing. (Cf. Cohen 1963, 1964, and 


gs More precisely, for each of these questions the expected answer was ‘no’. The 
models that were needed to supply a negative answer could not be constructed by 
the inner model method. Models constructed by Cohen’s method eventually showed 
the expected answers to be correct. 

hh As mentioned previously, Mostowski, building on earlier work of Fraenkel, had 
shown that in the version of set theory that allows individuals as well as pure sets, 
the axiom of choice could not be proved. But this left open the possiblity that the 
axiom of choice could be proved for the sets of ordinary mathematical practice, such 
as the real numbers. 

"This follows from the work of Hajnal, Levy and Shoenfield cited in footnote dd. 
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1966.) Cohen showed (under the assumption that ZFC is consistent) 
that there are models of ZF in which 

(a) AC and GCH hold, but there is a non-constructible set of integers 
(so V = L is false); 

(b) AC holds, but CH is false; 

(c) AC fails, and in fact the reals cannot be well-ordered. 

Unlike Godel’s inner model method which, by itself, could produce 
only the single model L, the forcing method has proved to be an ex¬ 
tremely flexible and powerful tool for the creation of models in which a 
wide variety of set-theoretical propositions can be seen to be consistent 
with the axioms of set theory. 

On the other hand, the Cohen forcing method is incapable of showing 
that a proposition is independent of V = L. It does give a systematic 
method for enlarging a countable transitive model of ZFC, M, to a 
larger model, N, in which propositions may well hold that do not hold 
in M. But the notion of “constructible set” is absolute between the 
two models, and thus the method gives no information about models of 
ZFC + V = L. 

There are several natural propositions of set theory whose status in 
L had been left open by Godel. The most noteworthy was Suslin’s hy¬ 
pothesis ( SH ), first given in Suslin 1920. This asserts that the following 
four properties characterize the real line as a linearly ordered set: 

(1) It is order-dense. That is, if a and b are reals with a <b, then for 
some real c, a < c < b. 

(2) It is order-complete. That is, every set of reals which is bounded 
above has a least upper bound. 

(3) It has no least element or greatest element. 

(4) Every pairwise disjoint collection of open intervals is at most 
countable. 

Condition (4) is an easy consequence of the fact (4') that R has a 
countable order-dense subset. It is quite easy to see, as Cantor showed 
(1895), that conditions (1) through (3) together with (4') do character¬ 
ize R up to order-isomorphism, so SH amounts to the assertion that (4') 
can be weakened to (4). 

The proposition SH is an extremely natural one that turns up also in 
the theories of partially ordered sets and Boolean algebras. It was also 
a natural candidate for a new proposition (other than those considered 
by Cohen) to be proved independent of the axioms of ZFC. Indeed, 
shortly after Cohen’s technique of forcing was developed, it was shown 11 
that Suslin’s hypothesis is both consistent with and independent of the 


Jech 1967, Tennenbaum 1968, and Solovay and Tennenbaum 1971. 
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axioms of ZFC. But that left open the question whether it holds or fails 
in L. 

There is a reformulation of SH that makes sense for any regular cardi¬ 
nal re. Suslin’s hypothesis is equivalent to the non-existence of a certain 
kind of tree (dubbed a Suslin tree) on Hi, the first uncountable cardinal. 
One can then generalize the Suslin problem to the question “For which 
uncountable regular cardinals re is there a Suslin tree on re?” 

It was shown by Jensen (1972) that Suslin’s hypothesis is false in L. 
Subsequently, with much more effort, Jensen completely determined for 
which regular cardinals re there is a re-Suslin tree in L. It was evident, a 
priori, that if re is weakly compact, then no re-Suslin tree exists. Jensen 
showed that if V — L, there is a re-Suslin tree for any regular uncount¬ 
able re that is not weakly compact. 

Jensen’s proof that Suslin’s hypothesis fails in L proceeds by deducing 
from V = L a previously unconsidered combinatorial principle, which 
Jensen dubbed diamond (0) and which is a considerable strengthening 
of the continuum hypothesis. In certain circumstances, this principle al¬ 
lows one to meet H 2 requirements in the course of a construction of length 
Hi. With its aid, the construction of a Suslin tree is relatively straight¬ 
forward. This principle and its variants have subsequently had numerous 
other applications in point-set topology and algebra. One noteworthy 
example is Shelah’s proof that Whitehead’s problem is undecidable. kk In 
one direction, Shelah shows that it follows from that every W -group 
of size Hi is free. In the other direction, he deduces from Martin’s axiom 
and 2 n ° > Hi that there is a W-group of size Hi that is not free. 

We shall now give a precise statement of 0- Recall that by definition 
a subset C of Hi is club if and only if C is unbounded in Hi and C 
contains the least upper bound of each countable subset of C. 11 A subset 
S of Hi is stationary if the intersection of S with any club subset of 
Hi is non-empty." 1 " 1 The proposition <0> asserts the existence of a family 
( A a : a < Hi) with the following properties: 

(a) A a C a for all a < Hi. 

(b) Let S C Hi. Then {a : A a — S (~l a} is stationary. 

Thus 4} gives a “guessing procedure” which, for any subset S of Hi, 
correctly predicts S fl a a significant portion of the time. 

kk A W-group is an abelian group G such that F,xt(G, Z) = 0, where Z is the set 
of integers. Every free abelian group is a W-group, and every countable W-group is 
free. The version of Whitehead’s problem considered by Shelah asks if every Vf'-group 
of size Mi is free. (Shelah’s results appear in his 1974, but we strongly recommend 
the exposition of his results in Eklof 1976.) 

"The word “club” comes from the phrase “closed unbounded”. 

mm The following analogy may be useful. Club subsets of Mi correspond to sub¬ 
sets of the unit interval [0,1] having Lebesgue measure 1; stationary subsets of Hi 
correspond to subsets of [0,1] having positive Lebesgue measure. 
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While Jensen’s solution to the Suslin problem on Ni is classical in 
spirit (since the proof of <0> involves the same ideas and techniques used 
in Godel’s proof that GCH holds in L), the situation for the general¬ 
ization of Suslin’s problem to higher regular cardinals is quite different. 
Again the proof turns on showing that certain remarkable combinato¬ 
rial principles hold in L. But it required a hitherto unknown detailed 
level-by-level analysis of the constructible hierarchy. (To facilitate this 
analysis, Jensen worked with a slight variant of the L a hierarchy that 
had better closure properties.) Roughly speaking, by studying the pre¬ 
cise place where an ordinal becomes singular, Jensen was able to exploit 
to good effect the residue of regularity that still remains just before 
that level. These techniques, which Jensen dubbed “the study of the 
fine structure of L”, have had several other striking applications, no¬ 
tably to the proof of various model-theoretic two-cardinal theorems in L 
and to the proof of the Jensen covering theorem, which we shall discuss 
presently. 

Subsequent to Jensen’s work, Silver developed an alternative ap¬ 
proach (the so-called Silver machines) that yields simpler proofs of most 
of the applications of the fine-structure theory (including all those men¬ 
tioned here). Silver never published his work, but a presentation can be 
found in Devlin 1984• 

Next we consider results having to do with the notion of absoluteness 
for sentences. A sentence 4> is said to be absolute (in ZFC) if we can 
prove in ZFC that <f> holds in V if and only if (j) holds in L. nn (Until the 
work of Cohen referred to above, one could not rule out the possibility 
that every sentence <f> in the language of set theory is absolute.) It is 
easy to see that every arithmetical sentence 00 is absolute, for L's notion 
of the integers is identical to that of V. Similarly, by exploiting the 
connection between II| sentences and the concept of well-ordering and 
the fact that the same ordinals appear in V and in L, one can easily 
show that 11} sentences are absolute. 

The best possible result in this direction was obtained by Shoenfield 
(1961), who showed that Ej formulas are absolute. This easily implies 
that a II3 sentence true in V holds also in L. pp Shoenfield’s theorem 
is quite useful in descriptive set theory, since it permits one to prove 


““This notion is closely related to, but not identical with, the notion of absolute¬ 
ness introduced in Section 3. 

°°An arithmetical sentence is one that asserts that some proposition holds in 
the structure consisting of the non-negative integers equipped with the operations of 
addition and multiplication. 

pp The converse need not be true. For example, the assertion that every real is 
constructible is easily seen to be II3. In the original forcing models of Cohen, it holds 
in L but not in V. 
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the absoluteness of statements that prima facie do not appear to be ab¬ 
solute. 

Even the simpler fact of the absoluteness of arithmetical statements 
is sometimes quite useful. It has the consequence that any arithmetical 
statement provable using AC and GCH is provable without their aid. 
For example, in the work (1965) of Ax and Kochen on the first-order 
theory of p-adic fields, a principal tool is the ultraproduct construction 
of models, and the theory of ultraproducts is much smoother if GCH 
is assumed. The remark just made ensures that the arithmetical con¬ 
sequences of their investigations (for example, the decidability of the 
theory of p-adic fields) are outright theorems of ZFC. (This observation 
is credited to Kreisel.) 

We turn next to the implications of large cardinal assumptions for 
the constructible universe. This part of our subject has had an involved 
history, starting with Scott’s proof in 1961 that if there are measurable 
cardinals then V is unequal to L, and culminating in the work of Silver, 
Kunen, and the present author, to be described below. Important inter¬ 
mediate work, which we shall not describe, was done by Gaifman (1964, 
1974) an d Rowbottom (1971). 

Before going on, it is worth pausing a moment to note Godel’s own 
attitude toward large cardinals. In his 1947 he held out the hope that 
future discoveries in this area might lead to new axioms that would settle 
the continuum problem. This has not yet happened, and the large car¬ 
dinal axioms known to date are relatively consistent with both CH and 
its negation. In conversations with the author, Godel expressed belief in 
the existence of measurable cardinals (see pages 167 and 260-261 below) 
and offered the following heuristic argument in favor of their existence. 
It is known that every strongly compact cardinal is measurable. But the 
existence of strongly compact cardinals is equivalent to the statement 
that a certain property of No is also shared by some cardinal greater 
than No- Godel then expressed the belief (which I am unable to present 
in a coherent way) that reasonable properties possessed by No should 
also be satisfied by some cardinal greater than No- qq 

A measurable cardinal is, by definition, a cardinal k such that there 
is a non-trivial {0, l}-valued K-additive measure defined on the collec¬ 
tion of all subsets of n. It has been known since Tarski 1962 that a 
measurable cardinal must be very large. In particular, it is a strongly 
inaccessible cardinal, and if k is the measurable cardinal in question, 


qq The restriction to “reasonable properties” is my addition to keep the argument 
from being blatantly fallacious. (The property of being the least infinite ordinal is 
satisfied only by Ho.) I do not find this particular argument for the existence of 
measurable cardinals to be convincing. 
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then k is the «',th strongly inaccessible cardinal. It follows, by Godel’s 
incompleteness theorem, that the existence of measurable cardinals can¬ 
not be proved in ZFC, and that ZFC + “there is a measurable cardinal” 
cannot be proved consistent from the assumption of the consistency of 
ZFC. rr Nevertheless, many set theorists (including the author) believe 
that measurable cardinals exist, so that their consequences for the con- 
structible universe are true. Among these are the following: 

(a) If A is an uncountable cardinal (in V), then L\ is an elementary 
submodel of L. 

(b) Hence (taking the case A = Ki of (a)), if 7 is an ordinal definable 
in L (such as the Hr of L, or the third strongly inaccessible cardinal of 
L), then 7 is countable in V. In particular, there are only countably 
many constructible sets of integers. 

It also follows from (a) that every uncountable cardinal of V is a limit 
cardinal in L. Hence if 7 is an infinite ordinal, then there are precisely 
card( 7 ) constructible subsets of 7 . Another consequence of (a) is that 
the satisfaction relation for L is definable in F. ss 

Further consequences make use of the construction of models gener¬ 
ated by a set of indiscernibles. Such models were first considered by 
Ehrenfeucht and Mostowski (1956), and they are in many ways rather 
special. For example, there are rather few types of elements realized 
in such models, and they tend to have many elementary monomor- 
phisms into themselves. Silver (1971) realized that the techniques of 
Ehrenfeucht-Mostowski could be applied fruitfully to the study of L, on 
the assumption that there is a measurable cardinal. 

Silver showed that there is a canonical generating class C of indis¬ 
cernibles for L (now known as the class of Silver indiscernibles). They 
can be characterized as follows: 

(1) C is a closed unbounded class of ordinals. 

(2) C generates L. That is, every element of L has a first-order defi¬ 
nition in L from a finite number of parameters in C. 

(3) The members of C are indiscernible in L. That is, any two in¬ 
creasing n-tuples from C have the same first-order properties in L. In 
particular, any two members of C look completely alike in L. 

One can show that every uncountable cardinal of V is a Silver indis¬ 
cernible and that the Silver indiscernibles are very large cardinals in L. 
(They are strongly inaccessible, Mahlo, weakly compact, etc.) Indeed, 
every large cardinal property that is compatible with V = L holds, in 
L, of the Silver indiscernibles. 


rr Provided no arithmetical consequence of ZFC is false. 

ss This should be contrasted with Tarski’s theorem on the undefinabmty of truth, 
which implies that the satisfaction relation for L (or V) is not first-order definable 
in L (or V, respectively). 
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In view of (3), one can introduce the following set of integers, 0#, 
which encodes the structure of L using only countably many bits of in¬ 
formation. An integer k lies in 0# if and only if it is the Godel number 
of some formula of the language of set theory </>(vi, ...,v n ) that holds in 
L when the v^s are replaced by an increasing n-tuple from C. 

The set 0 * is interesting in its own right. Prior to the discovery of 
0 # , there was no natural example of a definable set of integers that 
is not constructible."' It was shown in Solovay 1967 that 0# is a non- 
constructible A| set of integers. uu It was also proved there that every 
constructible set of integers is recursive in 0#, and thus is A3. 

We shall have occasion in the following to refer to the proposition 
“0# exists”. One way of expressing this is to say that L has a closed 
generating class of indiscernibles. This formulation has the drawback, 
however, that it is not expressible in the usual language of set theory 
(since it involves bound class variables). One can, however, produce a 
II2 formula, <j){x), that (if a measurable cardinal exists) holds only of 
0 # . Then “0# exists” can be taken to mean (3x)(j>(x). One can show 
that all the consequences of measurable cardinals for the structure of L 
mentioned above already follow from the proposition “0# exists”. 

Kunen has proved that the proposition “0# exists” is equivalent to the 
existence of a non-trivial elementary monomorphism of L into itself. vv 
This should be compared with the following equivalent of the proposi¬ 
tion “a measurable cardinal exists”: There is a non-trivial elementary 
embedding of V into some transitive class M. (In each case, the phrase 
“non-trivial” means “not the identity map”.) 

In view of the fact that measurable cardinals are extremely large car¬ 
dinals whose existence is incompatible with the axiom of constructibility, 
it is of interest to consider the question, “Which large cardinal axioms 
are compatible with V = LT' Godel had remarked at the end of his 
1938 that 

In this connection, it is important that the consistency proof for 
A [that is, V — T] does not break down if stronger axioms of 
infinity (e.g., the existence of inaccessible numbers) are adjoined 
to T. Hence the consistency of A seems to be absolute in some 
sense, although it is not possible in the present state of affairs to 
give a precise meaning to this phrase. 


“It was known that the existence of a non-constructible ordinal-definable set of 
integers is consistent. The question was whether a definable non-constructible set of 
integers could be proved to exist in some reasonable extension of ZFC. 

uu It follows from the Shoenfield absoluteness theorem mentioned earlier that the 
definability estimate A3 is best possible. 

vv Kunen never published his proof. A different proof of Kunen’s theorem, due to 
Silver, appears in V.4 of Devlin 1984■ 
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We have already indicated that the existence of measurable cardinals 
contradicts V = L in a strong sense. At the moment, the situation 
is the following. For large cardinal properties that are not too strong, 
for example those of being strongly inaccessible, or Mahlo, or weakly 
compact, the property holds of n in L if it holds of k in the universe, 
and the existence of 0# implies that all the Silver indiscernibles have 
the property. But stronger properties imply the existence of 0 # , and 
so contradict the proposition V — L in a strong way. A description 
of the precise dividing line would involve an excursion into the subject 
of partition cardinals; we content ourselves with the remark that there 
are currently no large cardinal properties for which the status of their 
compatibility with V = L remains unknown. ww 

Another remarkable result due to Jensen, his so-called covering theo¬ 
rem, ensures (roughly speaking) that if 0# does not exist, then there are 
rather tight connections between L and V (Devlin and Jensen 1975). 

The theorem is as follows: Assume that 0# does not exist. Let X 
be a set of ordinals. Then there is a set of ordinals Y, lying in L, such 
that (1) X is a subset of Y and (2) card(F) is at most the maximum of 
card(A) and Hi. (All cardinals referred to in (2) are computed in V.) 

The theorem can be roughly paraphrased as follows. Either 0 # exists, 
whence, by the results cited previously, L is a very sparse subclass of 
V; or 0# does not exist, in which case every set of ordinals in V can be 
tightly approximated from above by a constructible set. 

The theorem has several striking consequences, of which we mention 
only the following two, each under the assumption that 0^ does not 
exist: 

(1) Let ac be a singular strong limit cardinal. Then 2 K — k + . (That 
is, GCH holds at ac.) 

(2) Let ac be a singular cardinal. Then the least cardinal greater than 
ac, as computed in L, is the same as the least cardinal greater than k as 
computed in V. 

It is not hard to see that the proposition “0# does not exist” holds in 
every forcing extension of L. Thus the Jensen covering theorem can be 
used to obtain stringent limitations on what one can accomplish merely 
by forcing, without the use of large cardinals. (For example, without 
the use of fairly large cardinals one cannot construct a model of ZFC in 
which the first k for which 2 K / ac + is a singular cardinal. xx ) 


ww The best results on this dividing line appear in Baumgartner and Galvin 1978. 
xx In Magidor 1977 a model is constructed by the forcing method in which GCH 
first fails at Magidor’s ground model for this construction contains a “huge” 

cardinal. (The first huge cardinal is far bigger than the first measurable cardinal.) 
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Our final topic is the subject of inner models for large cardinals. We 
have already mentioned that the proposition V = L contradicts the exis¬ 
tence of measurable cardinals. However, there is a natural generalization 
of L, L[D], which has properties closely analogous to L and in which 
there is a measurable cardinal. 

The model L[D] is obtained by a slight generalization of the notion of 
relative constructibility introduced earlier. Let A be a class. Then one 
can show that there is a smallest transitive model M of ZF, containing 
all the ordinals, such that (Vx e M){A fl x e M ). This M we call L[A]. 
If A is a set of ordinals, then this new notion reduces to that considered 
on page 14. However, even if A is a set, we need not have A e L[A], 

Suppose now that re is a measurable cardinal. Then there is a distin¬ 
guished class of measures on re, the normal measures, defined as follows: 

A (two-valued) measure /x is a homomorphism from the Boolean al¬ 
gebra P(k) into the two-element Boolean algebra {0,1}. The measure 
/x is non-trivial if /x(re) = 1 and the measure of every one-element subset 
of re is 0. The measure n is re -additive if the union of fewer than re sets 
of measure zero itself has measure zero. 

A function /: re —» re is regressive on a subset D C. re if for every a e D 
we have f(a) < a. Finally, \x is normal if whenever /: re —> re is regressive 
on a set of measure one, then / is constant on a (possibly smaller) set 
of measure one. 

There is a somewhat more conceptual alternative characterization of 
normal measures in terms of ultrapowers. If /x is a countably additive 
measure on re, then the ultrapower construction gives rise to an elemen¬ 
tary embedding j: V —* M. Then /x is normal if and only if re is the 
least ordinal moved by j and the identity function represents re in the 
ultrapower. 

Normal measures tend to concentrate on the large cardinals less than 
re. For example, one can show that the set of strongly inaccessible cardi¬ 
nals less than re receives measure one from every normal measure on re. 
Moreover, one can show that every measurable cardinal carries a normal 
measure. 

We can now describe the inner model for a measurable cardinal. Let 
fi be a normal measure on re and let D be the collection of sets of /x- 
measure one. It is rather easy to show that, in L[D], re is a measurable 
cardinal. It follows from results of Kunen (1970) that the model L[D] 
depends only on re, that, in L[D], re is the unique measurable cardinal, 
and that D fl L[D] is the collection of sets of measure one with respect 
to the unique normal measure on re in L[D]. 

It turns out that L[D ] very closely resembles L. For example, results 
of Silver show that (a) GCH holds in L[D] (1971a), and (b) there is 
a good A3 well-ordering of the reals in L[D] (1971b). (This should be 
compared to the result cited on page 13 that there is a good well- 
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ordering of the reals in L.) The author showed that there is a “fine 
structure theory for L[D ]” quite analogous to the usual fine structure 
theory of L. (This work was never published, but it is implicit in the 
subsequent work of Dodd and Jensen (1981) on the “core model” K.) 

It is certainly an interesting fact in its own right that there is a nat¬ 
ural inner model for ZFC + “there is a measurable cardinal” that is 
quite analogous to the natural inner model L for the theory ZFC. But 
inner models for large cardinals also have important applications to the 
problem of establishing lower bounds on the consistency strength of 
propositions. For example, it is a theorem of Mitchell (198?) that if 
there is a model of ZFC in which GCH first fails at then there is a 
model of ZFC in which there is a measurable cardinal of high order. An 
essential ingredient in his proof is the construction of inner models for 
measurable cardinals of high order. 

It is therefore an important problem to find L-like models in which 
there are various large cardinals. Considerable progress has been made 
on this problem by Mitchell (1974, 1979), Dodd and Baldwin. However, 
recent results of Woodin show that certain large cardinals, if they have 
inner models at all, only have ones that behave very differently from the 
inner models discovered to date. 

In order to state these results, we shall recall the definitions of some 
large cardinals. yy First, let j be an elementary embedding of V into a 
transitive class M. If j is not the identity, then one can show that j 
moves some ordinal. The critical point of j is the least ordinal moved by 
j. The critical point of a non-trivial elementary embedding j: V —> M 
is always a measurable cardinal. The stronger the closure conditions 
imposed on M, the stronger the corresponding large cardinal property. 
(We remark that Kunen (1971) has shown that there is no non-trivial 
elementary embedding of V into itself.) Let k and A be infinite cardi¬ 
nals. Then k is X-strong if there is an elementary embedding j: V —» M 
with critical point k such that j(n) > A and R( A) C M. The cardinal 
k is strong if it is A -strong for every A > k. k is superstrong if there 
is an elementary embedding j: V —> M with critical point k such that 
R(j(k)) Q M. Finally, k is X-supercompact if there is an elementary 
embedding j: V —> M with critical point k such that M is closed under 
sequences of length A. 


yy Two good references on the subject of large cardinals are Kanamori, Reinhardt 
and Solovay 1978 and Kanamori and Magidor 1978. 
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As far as consistency strength goes, these concepts are related as fol¬ 
lows: If there is a k which is 2 K -supercompact, then there is a transitive 
model of ZFC with a proper class of superstrong cardinals; if there is 
a superstrong cardinal, then there is a transitive model of ZFC with a 
proper class of strong cardinals. Building on earlier work of Mitchell 
(1979), Dodd (198?) constructed L-like inner models with a proper 
class of strong cardinals. In these models, the reals have a good A3 
well-ordering. 

It was generally felt that the work of Mitchell and Dodd would eventu¬ 
ally lead to inner models for supercompact cardinals. However, in 1984 
Woodin proved the following remarkable theorem: If there is a super¬ 
strong cardinal, then there is no projective well-ordering of the reals. zz 
But, in all the inner models constructed by Mitchell, Dodd, and Bald¬ 
win, there is in fact a A3 well-ordering of the reals. Thus inner models 
for cardinals at least as large as superstrong must in some ways be very 
different from L. 

Woodin’s theorem raises many questions. Here are two: 

(1) What is the precise dividing line between cardinals which are com¬ 
patible with a A3 well-ordering of the reals and cardinals which are not 
so compatible? 

(2) Suppose that k is K + -supercompact. Is there a transitive class- 
model of ZFC, containing all the ordinals, in which k remains k + - 
supercompact and in which GCH holds? 

We remark that Woodin has constructed models where the first mea¬ 
surable cardinal k is K + -supercompact. However, in a model of ZFC + 
GCH the first n which is K + -supercompact has k measurable cardinals 
below it. Thus an affirmative answer to (2) would seem to require some 
sort of inner model construction for supercompact cardinals of a sort 
not ruled out by the “anti-inner-model” theorem of Woodin just cited. 

To sum up: almost all of the natural questions raised by Godel’s work 
on L have by now been settled. However, the topic of L -like models for 
large cardinals is still rife with mystery, though some important progress 
has been made. 


Robert M. Solovay 


zz Personal communication to the author. Woodin’s work relies ii 
way on earlier recent work of Foreman, Magidor and Shelah (198?). 



The consistency of the axiom of choice 
and of the generalized continuum hypothesis 
( 1938 ) 


Theorem. Let T be the system of axioms for set theory obtained from 
von Neumann’s system 1 S* by leaving out the axiom of choice (i.e., replacing 
Axiom III 3* by Axiom III 3); then, ifT is consistent, it remains so if the 
following propositions 14 are adjoined simultaneously as new axioms: 

1. The axiom of choice (i.e., von Neumann’s Axiom III 3*). 

2. The generalized continuum hypothesis (i.e., the statement that = 
N q+1 holds for any ordinal a). 

3. The existence of linear non-measurable sets such that both they and 
their complements are one-to-one projections of two-dimensional comple¬ 
ments of analytic sets (and which therefore are B 2 -sets in Lusin’s terminol¬ 
ogy 2 )- 

4. The existence of linear complements of analytic sets, which are of the 
power of the continuum and contain no perfect subset. 

A corresponding theorem holds if T denotes the system of Principia 
mathematical or Fraenkel’s system of axioms for set theory, 4 leaving out 
in both cases the axiom of choice but including the axiom of infinity. 

The proof of the above theorems is constructive in the sense that, if a 
contradiction were obtained in the enlarged system, a contradiction in T 
could actually be exhibited. 

The method of proof consists in constructing on the basis of the axioms 5 
of T a model for which the propositions 1-4 are true. This model, roughly 
speaking, consists of all “mathematically constructible” sets, where the 
term “constructible” is to be understood in the semi-intuitionistic sense 
which excludes impredicative procedures. This means “constructible” sets 
are defined to be those sets which can be obtained by Russell’s ramified 
hierarchy of types, if extended to include transfinite orders. The exten¬ 
sion to transfinite orders has the consequence that the model satisfies the 
impredicative axioms of set theory, because an axiom of reducibility can 


J Cf. von Neumann 1929. 

2 Cf. Luzin 1930, p. 270. 

3 Cf. Tarski 1933. 

4 Cf. Fraenkel 1925. 

5 This means that the model is constructed by essentially transfinite methods and 
hence gives only a relative proof of consistency, requiring the consistency of T as a 
hypothesis. 
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be proved for sufficiently high orders. Furthermore the proposition “Ev¬ 
ery set is constructible” (which I abbreviate by “A ”) can be proved to be 
consistent with the axioms of T, because A turns out to be true for the 
model consisting of the constructible sets. From A the propositions 1-4 
can be deduced. In particular, proposition 2 follows from the fact that 
all constructible sets of integers are obtained already for orders < uq, all 
constructible sets of sets of integers for orders < w 2 and so on. 

| The proposition A added as a new axiom seems to give a natural com- 557 
pletion of the axioms of set theory, in so far as it determines the vague 
notion of an arbitrary infinite set in a definite way. In this connection it is 
important that the consistency proof for A does not break down if stronger 
axioms of infinity (e.g., the existence of inaccessible numbers) are adjoined 
to T. Hence the consistency of A seems to be absolute in some sense, 
although it is not possible in the present state of affairs to give a precise 
meaning to this phrase. 


The consistency of 

the generalized continuum hypothesis 
( 1939 ) 


We use the following definitions: 1. M 0 = A; 2. M a+1 is the set of those 
subsets of M a which can be defined by propositional functions containing 
only the following concepts: ~, V, the e-relation, elements of M a , and quan¬ 
tifiers for variables with range M a ; 3. Mp = Y. aC pM a for limit numbers 
/3. Then M Wu or Mq (f2 being the first inaccessible number) is a model 
for the system of axioms of set theory (as formulated by A. Fraenkel, J. 
von Neumann, T. Skolem, P. Bernays) respectively without (or with) the 
axiom of substitution, the generalized continuum hypothesis (2 Nq = K a+ i) 
being true in both models. Since the construction of the models can be 
formalized in the respective systems of set theory themselves, it follows 
that 2 n “ = K a+ i is consistent with the axioms of set theory, if these ax¬ 
ioms are consistent with themselves. The proof is based on the following 
lemma. Any subset of M Wa which is an element of some Mp is an element 
of M Ua+1 . This lemma is proved by a generalization of Skolem’s method 
for constructing enumerable models. Since the axiom of choice is not used 
in the construction of the models, but holds in the models, the consistency 
of the axiom of choice is obtained as an incidental result. 



Consistency proof for 
the generalized continuum hypothesis 1 
(1939a) 


If M is an arbitrary domain of things in which a binary relation e is 
defined, call ''''propositional function over Af” any expression <j> containing 
(besides brackets) only the following symbols: 1. Variables x,y,... whose 
range is M. 2. Symbols ai,...,a n denoting 2 individual elements of M 
(referred to in the sequel as “the constants of <£”). 3. e. 4. ~ (not), V 

(or). 5. Quantifiers for the above variables x,y, - 2a Denote by M 1 the 

set of all subsets of M defined by propositional functions <j>(x) over M. 
Call a function / with s variables a “function in M” if for any elements 
221 xi, ..., x 8 of M | f(x i,..., x s ) is defined and is an element of M. If <f>(x) 
is a propositional function over M with the following normal form: 

(ri,..., x n )(3y lt y m )(zi,z k ){ 3«i,..., u e ) ... 

L(x,X!,... ,x n ,y!,... ,y m ,Zi,...,z k ,ui,... ,u e ,...) 

(.L containing no more quantifiers) and if a e M, then call “Skolem functions 
for <p and a” any functions /i,..., f m , gi ,..., y e , ■.. in M, with respectively 
n,...,n,n+k,.. .,n+k,... variables, such that for any elements xi ,..., x n , 
zi, ..., z k ,... of M the following is true: 

L(a,x 1 ,...,x n ,f 1 (x 1 ,...,x n ),...,f m (x 1 ,...,x n ),zi,...,z k , 

9 i(x i, ...,x n ,Z!,..., z k ),..., g e (x i, ...,x n ,z lt ..., z k ),...). 

The proposition <j>(a) is then equivalent with the existence of Skolem func¬ 
tions for (p and a. 

Now define: M 0 — {A}, M a+1 = M' a , Mp — T a< 0 M a for limit 
numbers j3. Call a set x “constructible” if there exists an ordinal a such 
that x e M a and “constructible of order a” if x e M a+ i — M a . It follows 
immediately that M a C Mp and M a e Mp for a < (3 and that: 


1 This paper gives a sketch of the consistency proof for propositions 1, 2 of Godel 1938 
if T is Zermelo’s system of axioms for set theory (1908) with or without axiom of substi¬ 
tution and if Zermelo’s notion of “definite Eigenschaft” is identified with “propositional 
function over the system of all sets”. Cf. the first definition of this paper. 

2 It is assumed that for any element of Af a symbol denoting it can be introduced. 

2a Unless explicitly stated otherwise, “propositional function” always means “proposi¬ 
tional function with one free variable”. 
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Theorem 1. x e y implies that the order of x is smaller than the order 
of y for any constructible sets x, y. 

It is easy to define a well-ordering of all constructible sets and to as¬ 
sociate with each constructible set (of an arbitrary order a) a uniquely 
determined propositional function 4 >a( x ) over M a as its “ definition ” and 
furthermore to associate with each pair (j> a , a (consisting of a propositional 
function cj> a over M a and an element a of M a for which (j> a {a) is true) 
uniquely determined “ designated Skolem functions for <f> a , a”. 3 

Theorem 2. Any constructible subset m of M W(i has an order < 

(i.e., a constructible set, all of whose elements have orders < ui^ has an 
order < 

Proof: Define a set K of constructible sets, a set O of ordinals and a set 
F of Skolem functions by the following postulates I-VII: 

I. M Ult C K and me K. 

II. If x e K, the order of x belongs to O. 

III. If x e K, all constants occurring in the definition of x belong to K. 

IV. If a e O and (/>„(#) is a propositional function over M a all of whose 
constants belong to K, then: 

1. The subset of M a defined by <f> a belongs to K. 

2. For any y e K ■ M a the designated Skolem functions for 4> a and 
y or ~(f> a and y (according as <j> a (y) or ~<f>a(y)) belong to F. 

V. If / e F, xi,... ,x n e K and (x\,.. .,x n ) belongs to the domain of 
definition of /, then f(x i,... ,x„) e K. 

VI. If x,y e K and x — y ± A the first 4 element of x — y belongs to K. 

VII. No proper subsets of K, O , F satisfy I-VI. 

| Theorem 3. If x ^ y and x,y e K-M a+1 , then there exists a z e K-M a 222 
such that z e x — y or z e y — x. 5 
(This follows from VI and Theorem 1.) 

Theorem 4. 6 K + O + F = 

since M U(l = and K + O + F is obtained from + {m} by forming 
the closure with respect to the operations expressed by II-VI. 

Now denote by r\ the order type of O and by a the ordinal corresponding 
to a in the similar mapping of O on the set of ordinals < t]. Then we have: 

Theorem 5. There exists a one-to-one mapping x' of K on M v such that 
x e y = x't y' for x,y e K and x' = x for x e . 

Proof: The mapping x' (which will carry over the elements of order 
a of K exactly into all constructible sets of order a for any a e O) is 


3 At first, with each <j> a an equivalent normal form of the above type has to be 
associated, which can easily be done. 

4 In the well-ordering of the constructible sets. 

s Theorems 3, 4, 5, are lemmas for the proof of Theorem 2. 

8 m means “power of m”. 
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defined by transfinite induction on the order, i.e., we assume that for some 
a eO an isomorphic 7 mapping / of K • M a on M„ 8 has been defined and 
prove that it can be extended to an isomorphic mapping g of K ■ M a +i 
on Ma+i 9 in the following way: At first those propositional functions over 
M a whose constants belong to K (hence to K ■ M a ) can be mapped in a 
one-to-one manner on all propositional functions over A% by associating 
with a propositional function <j) a over M a having the constants oi,..., a n 
the propositional function over M„ obtained from <j) a by replacing a* 
by aj and the quantifiers with the range M„ by quantifiers with the range 
Ma- Then we have: 

Theorem 6. < j> a (x) = (paix 1 ) for any x e K ■ M a . 

Proof: If </> a (x) is true, the designated Skolem functions for 4> a and x 
exist, belong to F (by IV, 2) and are functions in K-M a (by V). Hence they 
are carried over by the mapping / into functions in Ma which are Skolem 
functions for (/>&, x 1 , because the mapping / is isomorphic with respect to 
e. Hence <f> a (x ) D (fsix 1 )- 

~<l>a(x) D ~<fe(x 1 ) is proved in the same way. 

Now any 4> a over M a whose constants belong to K defines an element 
of K ■ M a + i by IV, 1, and any element b of K ■ M a +i can be defined by 
such a <t> a (if b e M a + i - M a , this follows by III; if b e M a , then “x e b” is 
such a <t> a )- Hence the above mapping of the <f> a on the <j>a gives a mapping 
g of all elements of K ■ M a + 1 on all elements of M&+i with the following 
properties: 

A. g is single-valued, because, if <f> a , ip a define the same set, we have 
<j> a {x) = ipa(x) for x t M a • K, hence ^(x 1 ) = ^(x 1 ) by Theorem 6, i.e., 
(j>a and ipa also define the same set. 

B. x e y = x 1 e g(y) for x e K • M a , y e K • M a + 1 (by Theorem 6). 

C. g is one-to-one, because if x, y e K • M a +i,x ± y, then by Theorem 3 
there is a ze ( x-y)+(y-x ), z e K-M a , hence z 1 e [g(x)-g(y)]+\g(y)-g(x)] 
by B. Hence g(x) ± g\y). 

D. g is an extension of the mapping f, i.e., g(x) = x 1 for x e K ■ M a . 

223 | Proof: For any b e K ■ M a a corresponding <j) a which defines it is x e b, 

hence </>a is x eb 1 , hence g(b) = b 1 . 

E. g maps K -M a exactly on Ms (by D), 10 and therefore K(M a +i — M a ) 
on Ms+i -Ms by C. 

F. g is isomorphic for e, i.e., g(x) e g(y) = x ey for any x,y e K-M a + 1- 


7 I.e., ie y = J{x) e /( y). In the following proof fix) is abbreviated by x 1 . 

8 I.e., of the elements of order < a of if on the elements of order < 71 of M v . 

9 I.e., of the elements of order < a of if on the elements of order < a of M v - 

10 Because / maps K ■ M a on 11% by inductive assumption. 
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Proof: If x e K ■ M a , this follows from B and D; if x e K ■ (M a+ 1 — M a ), 
then g(x) e M„+i — M„ by E, hence both sides of the equivalence are false 
by Theorem 1. 

By D and F, g is the desired extension of f and hence the existence 
of an isomorphic mapping x' of K on M n follows by complete induction. 
Furthermore, since all ordinals < cu M belong to O (by I, II) we have 0 = 0 
for 0 < Wfj, from which it follows easily that x = x' for x c M W(t . This 
finishes the proof of Theorem 5. 

Now, in order to prove Theorem 2, consider the set m' corresponding to 
m in the isomorphic mapping of K on M v . Its order is < r) < w^+i, 
because m! e M v and rj = O < by Theorem 4. Since x e m = 
x 1 e ml for x e K , we have x e m = x e ml for x e by Theorem 
5. Since furthermore to C M W[i , it follows that to = to' • M U(I , i.e., m is an 
intersection of two sets of order < w M +i, which implies trivially that it has 
an order < u^+i. 

Theorem 7. M Uul considered as a model for set theory satisfies all axioms 
of Zermelo 11 except perhaps the axiom of choice, and Mq (fl being the 
first inaccessible number) satisfies in addition the axiom of substitution, 
if in both cases “definite Eigenschaft”, respectively “definite Relation”, is 
identified with “propositional function over the class of all sets” (with one, 
respectively two, free variables). 

Sketch of proof for M Uu : Axioms I, II are trivial, Axiom VII is satisfied 
by Z = Mu, Axioms III-V have the form (3a:)(u)[u e x = <f>(u )], where 
the (j> are certain propositional functions over M Wui . Hence, by definition 
of M a+ 1, there exist sets x in M Uu+ 1 satisfying the axioms. But from 
Theorem 1 and Theorem 2 it follows easily that the order of x is smaller 
than lj u for the particular <j> under consideration, so that there exist sets x 
in the model satisfying the axioms. 

For Mq Axioms I-V and VII are proved in exactly the same way, and the 
axiom of substitution is proved by the same method as Axioms III-V. Now 
denote by “A” the proposition “There exist no non-constructible sets”, 12 
by “R” the axiom of choice and by “C” the proposition “2 N “ = N q+ i for 
any ordinal a”. Then we have: 

Theorem 8. A D R and Ad C. 

Ad R follows because for the constructible sets a well-ordering can be 
defined, and Ad C holds by Theorem 2, because M Wa = N Q . 

Now the notion of “constructible set” can be defined and its theory 
developed in the formal systems of set theory themselves. In particular 


11 Cf. Zermelo 1908. 

12 In order to give A an intuitive meaning, one has to understand by “sets” all objects 
obtained by building up the simplified hierarchy of types on an empty set of individuals 
(including types of arbitrary transfinite orders). 
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Theorem 2 and, therefore, Theorem 8 can be proved from the axioms of 
set theory. Denote the notion of “constructible set” relativized for a model 
M of set theory (i.e., defined in terms of the e-relation of the model) by 
constructibleM', then we have: 

224 | Theorem 9. Any element of M Ua ( respectively, Mn) is construct.ible M „ w 

( respectively, constructibleM n )', in other words: A is true in the models M Uul 
and Mq. 

The proof is based on the following two facts: 1. The operation M ' 
(defined on p. 220) is absolute in the sense that the operation relativized 
for the model applied to an x e , gives the same result as the 

original operation (similarly for Mq). 2. The set N a which has as elements 
all the Mp (for f3 < a) is constructibleM^ for a < u u and construct,ibleM n 
for a < S7, as is easily seen by an induction on a. From Theorem 9 and 
the provability (from the axioms of set theory) of Theorem 8 there follows: 

Theorem 10. R and C are true for the models M Uuj and Mq. 

The construction of and Mq. and the proof for Theorem 7 and The¬ 
orem 9 (therefore also for Theorem 10) can (after certain slight modifica¬ 
tions) 13 be accomplished in the respective formal systems of set theory 
(without the axiom of choice), so that a contradiction derived from C, R, 
A and the other axioms would lead to a contradiction in set theory without 
C, R, A. 


13 In particular for the system without the axiom of substitution we have to consider 
instead of M Wu , an isomorphic image of it (with some other relation R instead of the 
e-relation), because contains sets of infinite type, whose existence cannot be proved 
without the axiom of substitution. The same device is needed for proving the consistency 
of propositions 3, 4 of the paper quoted in footnote 1. 



The consistency of the axiom of choice 
and of the generalized continuum hypothesis 
with the axioms of set theory 
( 1940 ) 


Introduction 

In these lectures it will be proved that the axiom of choice and Cantor’s 
generalized continuum hypothesis (i.e., the proposition that 2 Na = H q+ i for 
any a) are consistent with the other axioms of set theory if these axioms are 
consistent. The system E of axioms for set theory which we adopt includes 
the axiom of substitution (cf. Fraenkel 1927, page 115) and the axiom of 
“Fundierung” (cf. Zermelo 1930, page 31) but of course does not include 
the axiom of choice. It is essentially due to P. Bernays (cf. Bernays 1937 ) 
and is equivalent with von Neumann’s system S* + VI (cf. 1929), if the 
axiom of choice is left out, or, to be more exact, if Axiom III3* is replaced 
by Axiom III3. What we shall prove is that, if a contradiction from the 
axiom of choice and the generalized continuum hypothesis were derived in 
E, it could be transformed into a contradiction obtained from the axioms 
of E alone. This result is obtained by constructing within E (i.e., using 
only the primitive terms and axioms of E) a model A for set theory with 
the following properties: 

1) the propositions which say that the axioms of E hold for A are theo¬ 
rems demonstrable in E, 

2) the propositions which say that the axiom of choice and the general¬ 
ized continuum hypothesis hold in A are likewise demonstrable in E. 

In fact there is a much stronger proposition 1 which can be proved to hold 


1 [Note added in 1951: In particular, this stronger proposition implies that there 
exists a projective well-ordering of the real numbers (to be more exact, one whose cor¬ 
responding set of pairs is a PCA-set in the plane). This follows by considering those 
pairs of relations s,e between integers which, for some -y < uji are isomorphic with 
the pair of relations <, a/3 ( F‘a e F‘0) confined to 7. The class M of these pairs s, 
e can also be defined directly (i.e., without reference to the previously defined F) by 
requiring that (1) s is to be a well-ordering relation for the integers, and (2) e, with 
respect to the well-ordering s, satisfies certain recursive postulates, which are the exact 
analogues of the postulates by which F is defined (cf. Dfn 9.3). The definition of M, in 
this form, contains quantifiers only for integers and sets of integers (i.e., real numbers) 
which ensures the projective character of the object defined and makes it possible to 
determine its projective order by counting the “changes of sign” of the quantifiers for 
real numbers occurring. In terms of M a projective well-ordering of the real numbers 
(of the order mentioned) can then be defined. As to consequences of this state of affairs, 
cf. Kuratowski 1948 .] 
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in A and which has other interesting consequences besides the axiom of 
choice and the generalized continuum hypothesis (cf. page 47). 

In order to define A and to prove the above properties of it from the 
axioms of E, it is necessary first to develop abstract set theory to a certain 
extent from the axioms of E. This is done in Chapters II IV. Although 
the definitions and theorems are mostly stated in logistic symbols, the 
theory developed is not to be considered as a formal system but as an 
axiomatic theory in which the meaning and the properties of the logical 
symbols are presupposed to be known. However, to everyone familiar with 
mathematical logic it will be clear that the proofs could be formalized, 
using only the rules of Hilbert’s “engerer Funktionenkalkul”. In several 
places (in particular for the “general existence theorem” on page 8 and 
the notions of “relativization” and of “absoluteness” on page 42) we are 
concerned with metamathematical considerations about the notions and 
propositions of the system E. However, the only purpose of these general 
metamathematical considerations is to show how the proofs for theorems 
2 of a certain kind can be accomplished by | a general method. And, since 
applications to only a finite number of instances are necessary for proving 
the properties 1) and 2) of the model A, the general metamathematical 
considerations could be left out entirely, if one took the trouble to carry 
out the proofs separately for any instance. 2 

In the first introductory part about set theory in general (i.e., in Chap¬ 
ters II-IV) not all proofs are carried out in detail, since many of them 
can be literally transferred from non-axiomatic set theory and, moreover, 
an axiomatic treatment on a very similar basis has been given by J. von 
Neumann {1928a). 

For the logical notions we use the following symbols: (A), (3A), ~, ., 
V, D, =, =, (E\X), which mean respectively: for all X, there is an X, 
not, and, or, implies, equivalence, identity, there is exactly one X. X = Y 
means that X and Y are the same object. “For all A” is also expressed by 
free variables in definitions and theorems. 

The system E has in addition to the e-relation two primitive notions, 
namely “class” and “set”. Classes are what appear in Zermelo’s formula¬ 
tion {1908, page 263) as “definite Eigenschaften”. However, in the system 
E (unlike Zermelo’s) it is stated explicitly by a special group of axioms 
(group B on page 5) how definite Eigenschaften are to be constructed. 
Classes represent at the same time relations between sets, namely a class 
A represents the relation which subsists between x and y if the ordered 
pair { x , y) (defined in 1.12) is an element of A. The same e-relation is used 


2 In particular also the complete inductions used in the proofs of Theorems 1.16, Ml, 
M2 are needed only up to a certain definite integer, say 20. 
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between sets and sets, and sets and classes. The axiom of extensionality 
(Fraenkel’s Bestimmtheitsaxiom) is assumed for both sets and classes, and 
a class for which there exists a set having the same elements is identified 
with this set, so that every set is a class. 3 On the other hand a class B 
which is not a set (e.g., the universal class) can never occur as an element 
owing to Axiom A2, i.e., B e X is then always false (but meaningful). 


| Chapter I 

The axioms of abstract set theory 

Our primitive notions are: class , denoted by <tls; set, denoted by 9J1; and 
the diadic relation e between class and class, class and set, set and class, 
or set and set. The primitive notions appear in context as follows: 

<£ls(A), A is a class, 
fD f l(A), A is a set, 

XeY, Xey, xeY, xey, 

where the convention is made that X, Y, Z,. .. are variables whose range 
consists of all the classes, and that x, y, z, ... are variables whose range is 
all sets. 

The axioms fall into four groups, A, B, C, D. 

Group A. 

1. <£ls(x) 

2. X eY .D.m(X) 

3. (u)[t t€ X. = .'uey].D.X = y 

4. (x)(y)(3z)(u)[u ez. = :u = x.\/.u = y] 

Axiom 1 in the group above states that every set is a class. A class which 
is not a set is called a proper class, i.e., 

1. Dfn q3r(X) = ~9K(X). 

Axiom 2 says that every class which is a member of some class is a set. 
Axiom 3 is the principle of extensionality, that is, two classes are the same 


3 Similarly, von Neumann 1928a. 

Note. Dots are also used to replace brackets in the well-known manner. 
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if their elements are the same. Axiom 4 provides for the existence of the set 
whose members are just x and y, for any sets x and y. Moreover, this set 
is defined uniquely for given x and y, by Axiom 3. The element z defined 
by 4 is called the non-ordered pair of x and y , denoted by {x, y}, i.e., 

1.1 Dfn u e {x, y} = (u = x V u — y). 

1.11 Dfn {x} = {x,x}. 

{x} is the set whose sole member is x. 

4 | 1.12 Dfn {x,y) = {{x},{x,y}}. 

(x, y) is called the ordered pair of x and y. We have the following theorem: 

1.13 (x,y) — (u,v) ,D:x = u.y = v, 

that is, two ordered pairs are equal if and only if the corresponding elements 
of each are equal. In this sense, (x, y) is an ordered pair. The proof of this 
theorem is not difficult (cf. Bemays 1937, page 69). 

The ordered triple may now be defined in terms of the ordered pair. 

1.14 Dfn (x,y,z) = (x,{y,z)). 

The corresponding theorem holds for the ordered triple. The n- tuple can 
be defined by induction as follows: 

1.15 Dfn (xi,x 2 ,... ,x n ) = (x l5 (x2,... ,x n )). 

This gives the theorem 

1.16 (xi,...,x n ,(x n+ i,...,x n+p )) = (xi,...,x n ,x n+ i,...,x„ +p ), 
which is proved by induction on n. 

In order that ( ) be defined for any number of arguments it is convenient 
to put 

1.17 Dfn (x) = x, 

which entails the equation 1.16 also for the case p = 1. 

We also define inclusion C and proper inclusion C. 


1.2 Dfn X CY =.(a)[«£l.D.«f]; 
X CY.ee: X C Y.X / Y. 
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A class is said to be empty if it has no members; “X is empty” is denoted 
by “(&n(.A)”, i.e., 

1.22 Dfn <&n(X) = (u)~u e X. 

If X and Y have no members in common, we write “<£f(X, Y)”, that is, “X 
and Y are mutually exclusive", i.e., 

1.23 Dfn <*r (X,Y) = (u)~(u e X .u eY). 

X is said to be one-many ( single-valued ), denoted by “iln(A)”, if for any 
u there exists at most one v such that {v, u) e X, that is: 

| 1.3 Dfn iln(X) = (u, u,iy)[(u,u) e X. (w,u) e X :D.v = to]. 5 

The axioms of the second group are concerned with the existence of classes: 


Group B. 

1. (3A)(x,y)[(x,y) e A. = .x e y] 

2. (A)(B)(3C)(u)[ueC. = :ueA.ueB] 

3. (A)(3B)(u)[uefl. = .~(ueA)] 

4. (A)(3B)(x)[x e B . = . (3y)({y, x) e A)] 

5. (A)(3B)(x,y)[{y,x) e B. = .x e A] 

6. (A.)(3B)(x,y)[(x,y) e B . = . (y,x) e A] 

7. ( A)(3B)(x,y,z)[(x,y,z ) e B . = . (y, z,x) e A] 

8. (A)(3B)(x, y, z)[{x , y,z) e B . = .(x, z, y) e A] 

Axiom B1 is called axiom of the e-relation, B2 axiom of intersection, B3 
axiom of the complement, B4 axiom of the domain, B5 axiom of the direct 
product (because it provides essentially for the existence of V x A, V being 
the universal class), B6-8 axioms of inversion. 4 Note that the class A in 
Axiom B1 and the class B in Axioms B5-8 are not uniquely determined, 
since nothing is said about those sets which are not pairs (triples), whether 
or not they belong to A (B). On the other hand in Axioms B2 4 the classes 
C and B are uniquely determined (owing to Axiom A3). These uniquely 
determined classes in B2 4 are denoted respectively by A B, -A, 33(A) and 
called intersection of A, B, complement of A, domain of A, respectively. 
Thus A • B, —A, 33(A) are defined by the following properties. 


4 Note that Axioms B7 and B8 have as consequences similar theorems for any per¬ 
mutation of a triple. 
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1.4 Dfn x e A- B = x e A.x e B 

1.41 Dfn x e —A = ~x e A 

1.5 Dfn x e T>(A) = (3y)(y, x) e A 

The third group of axioms is concerned with the existence of sets. 
Group C. 

1. (3a){~<£m(a). ( x)[x e a. D. (3 y)[y ea.xC!(]]} 

2. (x)(3y)(u,v)[u e v .v e x :D.u e y] 

3. (x)(3y)(u)[uCx.D.uey] 

4. ( x , A){iln(A). D . (3y)(u)[u ey . = . (3u)[u e x. (u, v) e A]]} 

Axiom 1 is the so-called axiom of infinity. There is a non-void set a 
such that, given any element x of a, there is another element y of a, of 
which a; is a proper subset. According to Axiom 2, for any set x there is 
a set y including the sum of all elements of x. Axiom 3 provides for the 
6 | existence of a set including the set of all subsets of x. Axiom 4 is the 

axiom of substitution-, 5 for any set x and any single-valued A, there is a set 
y whose elements are just those sets which bear the relation defined by A 
to members of x. (Instead of C4, Zermelo used the Aussonderungsaxiom: 

(x, A)(3y)(u)[u e y. = : u e x.u e A], 

that is, there is a set whose members are just those elements of x which 
have the property A.) 

The following axiom (proved consistent by J. von Neumann (1929)) is 
not indispensable, but it simplifies considerably the later work: 

Axiom D. ~<£m(A). D. (3u)[u e A. <£$(u, A)\ 

that is, any non-void class A has some element with no members in common 
with A. 6 It is a consequence of D that 

1.6 ~x e x, 


5 [Note added in 1951 : The term now in use for Axiom C4 is “axiom of replacement”.] 

6 This axiom is equivalent to the non-existence of infinite descending sequences of 

sets 7 (i.e., such that a;< + i e x t ) where however the term “sequence” refers only to 
sequences representable by sets of the system under consideration. That is (using the 
definitions 4.65, 7.4, 8.41 below) Axiom D is (owing to the axioms of the groups A, B, 
C, E) equivalent to the proposition + 1) e y‘t\. 

7 [Note added in 1951 : In this form Axiom D, under the name of “Fundierungsaxiom”, 
was first formulated by E. Zermelo ( 1930 ).] 
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for, if there were such an x, x would be a common element of x and {a;}, 
but, by D, taking {a;} for A, x can have no element in common with {x}. 
Likewise 

1.7 -[xey.yex]. 

This follows by considering {x, y} in an analogous way. 

The following axiom is the axiom of choice. 8 

Axiom E. (3A){Hn(A). (x)[~<&n(x). D. (3 y)[y e x.{y,x) e A]]} 

This is a very strong form of the axiom of choice, since it provides for the 
simultaneous choice, by a single relation, of an element from each set of the 
universe under consideration. From this form of the axiom, one can prove 
that the whole universe of sets can be well-ordered. This stronger form of 
the axiom, if consistent with the other axioms, implies, of course, that a 
weaker form is also consistent. 

| The system of axioms of groups A, B, C, D is called E. 9 If a theorem 7 
is stated without further specification it means that it follows from E. If 
Axiom E is needed for a theorem or a definition, its number is marked by *. 


| Chapter II 8 

Existence of classes and sets 

We now define the metamathematical notion of a primitive propositional 
function (abbreviated ppf). A ppf will be a meaningful formula containing 
only variables, symbols for special classes A %,..., A&, e, and logical opera¬ 
tors, and such that all bound variables are set variables. For example, 

(u)(u e X .D.u e A) and (u)[u e x . = .(v)[v e u.D.v e y]) 
are ppf. A formula is non-primitive if (X) or (3X) occurs. 


8 [lVote added in 1951: Using Dfn 4.65, the axiom of choice can be expressed in the 
following form, equivalent with Axiom E: There exist classes A for which x e y D A‘y e y.] 

9 The most important differences between S and the system of P. Bernays {1937) 

1. Bernays does not identify sets and classes having the same extension. 

2. Bernays assumes a further axiom requiring the existence of the class of all {x}, 
which allows B7 and B8 to be replaced by one axiom. 

Axiom D is essentially due to von Neumann {1929, page 231, Axiom VI 4), whose 
formulation however is more complicated, because his system has other primitive terms. 
The concise formulation used in the text is due to P. Bernays. 
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More precisely, ppf can be defined recursively as follows: Let II, F,..., 
denote variables or special classes, then: 

(1) Ilerisappf. 

(2) If <j) and ?/; are ppf, then so are ~<j> and 4> • 

(3) If is a ppf, then (3 x)(j> is a ppf, and any result of replacing x by 
another set variable is a ppf. 

(4) Only formulas obtained by 1, 2, 3 are ppf. 

Logical operators different from ~, ., 3, need not be mentioned since they 
can be defined in terms of these three. 

The following metatheorem says that the extension of any ppf is repre¬ 
sented by a class: 

Ml. General existence theorem: If ... ,x n ) is a ppf containing no 
free variables other than x\ ,..., x n (not necessarily all these), then there 
exists a class A such that, for any sets xi, ■ .. ,x n , 

(xi ,... ,x n ) e A. = .<j>(x 1,.. .,x n ). 

For the proof of this theorem, several preliminary results are needed. 

By means of the axioms on intersection and complement, it is possible 
to prove the existence of a universal class V and a null class 0. Because 
of the axiom of extensionality, 0 and V are uniquely determined by the 
properties 

2.1 Dfn (x)~(x e 0), 

2.2 Dfn {x)x e V. 

As a consequence of Axiom B5, the axiom of the direct product, and 
B6, the axiom of the inverse relation, we have 

9 | 2.3 (A)(3B)(x,y)[(x,y) e B. = .x e A]. 

The following three theorems are also consequeces of B5, B7, and B8. 

2.31 (i4)(3£f)(*, y, z)[(z, x,y) c B . = . (x,y) e A] 

2.32 (A)(3B)(x, y, z)[{x, z,y) e B . = . (x,y) e A] 

2.33 {A)(3B)(x, y, z)[{x, y,z) e B . = . (x,y) e A] 

For example, the first of these theorems is proved by substituting an ordered 
pair for the second member in the ordered pair appearing in B5, rewriting 
the variables properly. The other two are obtained by applying to 2.31 the 
axioms of inversion (B7 and B8). 

Substituting (x\,X2, ■ ■ ■, x n ) for x in B5 in a similar way, we get 
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2.4 (A)(3B){y,x 1 ,...,x n )[{y,x 1 ,x 2 ,...,x n ) eB. = . 

(xi,...,x n ) e A]. 


From this, by iteration, 

2.41 (A)(3B)(yi,... ,y k ,xi,... ,x n )[{yi,... ,y k ,xi, ■ ■ ■ ,x n ) e B . = . 

(, x \,..., x n ) e A ]. 

Similarly, 

2.5 (A)(3B){y u ■■■,Vk,x i,..., x„)[(a;i, j/i,..., y k , x 2 ,..., x n ) e B. =. 

(ari,...,a: n ) e A], 

This may be obtained by iteration from the case k = 1, and this case in 
turn is a special case of 2,32 obtained by substituting (x 2 ,...,x n ) for y 
and applying Theorem 1.16. 

The following theorems are derived in an analogous fashion, by substi¬ 
tuting (yi,.. .,yk) for z and y respectively in 2.33, 2.3, and applying 1.16, 

2.6 (A)(3B)(xi,x 2 ,y 1 ,... ,y k )[(xi,x 2 ,yi,... ,y k ) e B. = . 

{xi,x 2 ) e A], 

2.7 (A)(3B)(x,y 1 ,...,y k )[(x,yi,...,y k ) eB. = .xe A], 

The next (and for the present, the last) theorem is a generalization of 
Axiom B4, the axiom of the domain, and is obtained by substituting, in 
B4, {x 2 ,... ,x n ) for x. 

2.8 {A)(3B)(x 2 , .... 3t„)[(i 2l ...,x n )eB. = . (EaOKari eAj| 

In particular B = S(A) satisfies this equivalence. 

In the proof of the general existence theorem, it can be assumed that 
none of the special classes Aj appears as the first argument of the e-relation, 
because A» e F can be replaced by | (3x)(x = Ai.x e T) (by Axiom A2) 
and x = Ai can be replaced by (u)[u e x = u e Aj] (by Axiom A3). 

The proof of Ml is an inductive one, the induction taking place on the 
number of logical operators in c/>. 

Case 1. 0 has no logical operators. 

In this case 0 has one of two possible forms, x r e x s and i f e Aj, where 
1 < r, s < n. If 0 is of the form x r e x s , we must show that there exists 
a class A such that (sq,... ,x n ) e A. = ,x r e x s . If r = s, take as A the 
null class 0, since, by 1.6, ~(av e x r ). If r / s, 0 must be either of the 
form x p e x q or x q e x p , where p < q. For x p e x q , Axiom B1 provides for 
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the existence of an F such that (x pi x q ) e F. = .x p e x q . For x q e x p , B1 
followed by B6 provides for the existence of an F such that 

(x p ,x g )eF. = .x q ex p . 

Therefore, in either case there is an F such that 

(x p ,x q ) e F . = .<p(xx n ). 

Now, by 2.6, there is an F\ with the property: 

(x p , x q , x q+1 ,..., x n ) e Ft. =. ( x p , x q ) € F. 

Then by 2.5 there exists F 2 such that 

(x p ,...,x n ) e F 2 . = .(x p ,x q ,x q+ i,...,x„) efi, 

and finally, by 2.41 there exists a class A such that 

(*!,...,*») eA. = .{x p ,...x n ) eF 2 . 

Combining these equivalences the result is: 

(xi,...,x n ) eA. = .<p(xi,...,x n ). 

Now suppose (j) is of the form x r e A*. By 2.3, there is an F such 
that (x r ,x r+ i) e F. = .^(xi,...,x n )- (If r = n, use Axiom B5 to get 
(x r -i,x r ) e F . = . ij>(x 1,..., x n ).) Now, as above, by means of Theorems 
2.6 and 2.41, combining the resulting equivalences establishes the existence 
of A. 

Case 2. <j> has m logical operators (m > 0). 

Then <f> has one of the following three forms: 

(a) ~V>; ( b) ip. x; (c) (3:r)0. 

The hypothesis of the induction is that, for all ppfs ip(xi,...,x n ) with 
mi < m logical operators and such that no Aj appears in the context 
Aj e T, there exists an A with the properties required by the theorem, (p, 
X, and 9 are ppfs with fewer than m logical operators, xp and x have no 
other free variables than at most x%,... ,x n , whereas 9 has no other free 
variables than at most x,x\, ... ,x„, and Ai cannot appear in the context 
A, e T in xp, x or because it does not appear in <p in this context. 



Continuum hypothesis 


43 


Therefore, by the hypothesis of the induction, there exist classes B, C, D 
such that 

{xi,...,x n ) t B , = .ip(x i,...,x n ), 

(xi,...,x n ) eC. = .x(x 

(x,xi, .. .,x n ) e D , = .9{ x,xi,.. .,x n ). 

For (a) take A as -B, since, by Axiom B3, 

(xj,... ,x n ) e —B. = .~((xi,.. .,x n ) e B), 

\ so that (xi,... ,x n ) e —B. = .~\l>(x i,... ,x n ), that is 
(xi,...,x„) e -B. = .<i>{x i,...,x n ). 


For (b) take A as B ■ C, since by Axiom B2, 

(®i,... ,x n ) e B ■ C . = : (xi,... ,x„) e B. (xi,... ,x„) e C, 


that is, 


(xi,..., x n ) e B ■ C . =: ^>(xi,..., x„). *(xi,..., x n ); 


therefore 

(xi,...,x„) eB-C. = .4>(x 1 ,...,x n ). 


For (c), take A as the domain S(-D), since by Theorem 2.8 


therefore 
so that 


(xi,-..,x„) e £>(£)). = . (3x)[{x,x!,...,x n ) e D]\ 

(xi,..., x n ) e D ( D ). =. (3x)0 (x, xi,..., x n ), 
(xi,... ,x„) eS)(D) . = .<j>(xi,... ,x n ). 


This completes the proof of the general existence theorem for primitive 
propositional functions. 

The general existence theorem is a metatheorem, that is, a theorem about 
the system, not in the system, and merely indicates once and for all how 
the formal derivation would proceed in the system for any given ppf. 

So far, the existence theorem is proved only for ppfs; but the use of sym¬ 
bols introduced by definition yields a wider class of propositional functions 
for which it would be desirable to have the existence theorem valid. With 
this in view, examine the defined symbols introduced thus far. They may 
be classified into four types, as follows: 


Particular classes: 0, V,... 
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2. Notions : £W(X), ?JJt(X), iln(X), X C Y,. . ., 

3. Operations: -X, £> (X), X ■ Y ,..., 

4. Kinds of variables: x, X,... (defined by notions). 

Henceforth it is to be required that all operations and notions be mean¬ 
ingful, that is, defined, for all classes as arguments. This has been the 
case hitherto except for the pairs { x , y} and (x, y), and the n-tuples, which 
were defined for sets only. The extension for classes as arguments can be 
accomplished simply by replacing the free set-variables by class-variables 
in the definitions, i.e., 

3.1 Dfn («){ue{X,Y}. = :u = X.V.u = Y], 

3.11 Dfn {X} = {X,X}, 

3.12 Dfn (X,Y) = {{X},{X,Y}}, etc. 

By these definitions, e.g., {X,Y} is either {X, Yj or {X} or {Y} or 0 
according to whether both or one or none of X, Y are sets. 10 The same 
procedure of extension is to be applied in Definitions 4.211, 4.65, 6.31, 7.4, 
where the notions (or operations) under consideration are originally defined 
only if certain arguments are sets. 11 

12 | The following metamathematical ideas will be useful. A term is defined 

inductively so that (1) any variable is a term, and any symbol denoting 
a special class is a term; (2) if 2t is an operation with n arguments and 
IT,... ,r n are terms, then 21 (IT,..., F n ) is a term; (3) there are no terms 
other than those obtainable from (1) and (2). If 95 is a notion with n argu¬ 
ments and IT,..., T„ are terms, then 95(Ti,..., T n ) is said to be a minimal 


10 [Note added in 1951 : One may wish, for aesthetic reasons, that in analogy with 
Axiom A2 one should have ( X,Y) e Z.D . SOT (X). XI (Y). This can easily be accom¬ 
plished by replacing in Dfn 3.1 u = X by u = X V ppt(X). u e X ], and likewise u = Y 
by u = Y V pPr(y) .u e Y], If this definition is adopted, m(A‘x) can be dropped in 
Dfn 4.65. Otherwise it is indispensable, as was noted by Mr. W. L. Duda, who called 
my attention to its omission in the first edition. It is not difficult to define {X, Y } in 
such manner that 1.13 also holds for proper classes, but since there is never any occasion 
of making use of this fact there is no point in doing so.] 

n Note that in all these definitions it is absolutely unimportant how the notions 
or operations under consideration are defined for proper classes as arguments. 12 The 
only purpose of defining them at all for this case is to simplify the metamathematical 
concepts of “term” and “propositional function” defined on page 12 and the formulation 
of Theorems M2-M6. 

12 [Note added in 1951: A similar remark applies to many other concepts which by 
their usual definition are meaningful only for certain classes, e.g., <£m>, Con, etc. only for 
classes of pairs; 9Jt oj, £im only for sets of ordinals (with or without greatest element, 
respectively), etc. All that is aimed at in the subsequent definitions is that, for those 
arguments for which, by their usual definitions, the concepts defined are meaningful, the 
definitions given should agree with the usual ones. For OTor and £im e.g., this requirement 
can be satisfied by setting them both equal to S (cf. Dfn 7.31).] 
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propositional function or minimal formula. A propositional function may 
be defined recursively as any result of combining minimal propositional 
functions by means of the logical operators: V, ., D, = and quantifiers 

for any kind of variables. 

For each of the four types of symbols there is a corresponding kind of 
definition. 

1. A special class A is introduced by & defining postulate <I>(A), where 0 
is a propositional function containing only previously defined symbols, and 
it has to be proved first that there is exactly one class A such that 0(A). 

2. A notion *8 is introduced by the stipulation 

®(X 1 ,...,A n ) = 0(X 1 ,...,X n ), 

where 0 is a propositional function containing only previously defined sym¬ 
bols. 

3. An operation IX is introduced by a defining postulate 

{X 1 ,...,X n )<j>(*(X 1 ,...,X n ),X 1 ,...,X n ), 

where 0 is a propositional function containing only previously defined sym¬ 
bols, and it has first to be proved that 

(*!,..., X n )(E\Y)<My, Xi,..., X n ). 

4. A variable p is introduced by a stipulation that for any propositional 
function 0, (p)0(p) means 


(X)\*(X)D4>{X) 3, 

and (3p)0(p) means 

(3X)[Q3(X).0(X)], 

where IB is a previously defined notion, the extension of which is called the 
range of the variable y. 

Special classes, notions and operations are sometimes referred to by the 
common name “concepts”. 13 

All definitions so far introduced are of this type: 03 is called a normal 
notion if there is a ppf 0 such that 

03 (X 1 ,...,X n ). = .<f>(X 1 ,...,X n ), 


13 [Note added in 1951: The term “concept” only applies to notions and operations. 
Special classes should rather be called “objects”.] 
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21 is called a normal operation if there is a ppf <p such that 
Y e^{X 1 ,...,X n ). = .<P(Y,X 1 ,...,X n ), 

13 and a variable is called normal if its range consists of the | elements of a 
class. The propositional function <p(Xi ,..., X n ) is called norm,al if it con¬ 
tains only normal operations, normal notions, and normal bound variables; 
and a term is called normal if it contains only normal operations. 

M2. Any normal propositional function is equivalent to some ppf, 
and therefore Ml holds also for any normal propositional function 

<P{x i,...,x n ). 

Proof: Let 4 ,..., X n ) be the given normal propositional function. 
Since <p contains only normal bound variables, all bound variables 
not set variables can be replaced by set variables, e.g., (3y)x(y) by 
(3s)[® e A . x(®)], where A defines the range of the variable y. Next, for any 
notion 21 occurring in <p, since it is normal, the minimal propositional func¬ 
tion 2L(r x , ..., r n ) can be replaced by the equivalent ^(Id, •.., T„), where 
i>(Xi ,..., X n ) is a ppf. Then the only remaining notion is the e-relation. 
Again all contexts of the form T e A, where T is not a set variable, can be 
removed by the method explained on page 10 after Theorem 2.8, leaving 
only minimal formulas of the form u e T. But T, if not a variable or a special 
class, is of the form ©(Id,... ,r„), where 05 is a normal operation. But 
u e ©(Ti,... ,r„) can be replaced by ip(u,T i,...,T n ), where the ppf ip 
is such that u e ©(Id,...,T„). = . tp(u, Ti,..., T n ). In this way, <p is re¬ 
duced, getting all operations out. The final result of such reductions can 
be nothing other than a ppf. 

This completes the proof that Ml is valid for normal propositional func¬ 
tions. It remains only to verify that all concepts introduced so far are 
normal. This will be done by constructing for each of the correspond¬ 
ing expressions Ye 21 (Ad,..., X n ) and ©(Ad,..., X n ) equivalent propo¬ 
sitional functions containing only notions, operations and bound variables 
previously shown to be normal. These propositional functions are then 
equivalent to ppfs by Theorem M2. 

X e Y; e is normal, since X e Y is itself a ppf. 

X = Y . = .(u)[ueX , = .ueY] 

VJl(X)=.(3u)(u = X) 

?Pc(A‘). = .~QK(X) 

Z e {X,Y}. = : (Z = X.V.Z = Y).Tl(Z) 

Z e (Ad Y) . = . Z e {{A - }, {X,Y}} and similarly for triples, etc. 

X C Y . = .(u){ueX .D.ueY) 

X CY . = .(u){ueX ,D.ueY).~(X = Y) 
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iin(X) . = . (u,v,w)[(u,v) e X . (w,v) e X :D.m = w\ 

X e(-A). = :VJl(X).~(X e A) 

XeAB.= :XeA.XeB 
XeV(A). = :m{X).{3y)[(y,X)eA] 

<Sm(X).=.~(3u)(ueX) 

<%(X,Y). = . ~(3u)(u eX.ueY) 

The general existence theorems Ml, M2 (and likewise the later | Theo- 14 
rems M3-M6) are frequently used in these lectures without being quoted 
explicitly. 

The particular classes Ai ,..., Ak that may appear in the normal propo¬ 
sitional function 4>(xi, ... ,a; n ) are entirely arbitrary, and may therefore be 
replaced by the general class variables X\,...,Xk, so that the existence 
theorem takes the form 

M3. X k )(3A)( Xl , *„)[<*!,..., x n ) e A . =. 

if <j> is normal. 

The definitions that follow are mostly based on the existence theorem 
in this form. In each application of M3 it is apparent upon inspection that 
0 is normal. 

The direct (outer) product Ax B is defined by the postulate: 

4.1 Dfn (x)[x e Ax B . = .(3y,z)[x = (y,z) : y e A.z e B]\. 

A and B are considered as the constant classes in this application of M3, 
which assures the existence of A x B for all A and B. That Ax B is unique 
is guaranteed by the axiom of extensionality. 

4.11 Dfn A 2 = AxA 

4.12 Dfn A* = Ax(A 2 ) 

A 4 ,A 5 ,... are defined similarly. Thus V 2 is the class of all ordered pairs, 

V 3 is the class of all ordered triples, etc. Since every triple is a pair, it 
follows that 

4.13 V 3 C V 2 . 

Relations are to be defined as classes of ordered pairs, triadic relations 
as classes of ordered triples, etc. 


4.2 Dfn md(X). = .X <ZV 2 , 
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4.21 Dfn 9tel 3 (X).EE.X C V 3 , 

and similarly for all n > 2. “Stel(X)” may be written as “Stel 2 (X)”. 

If A is a relation, then (x, y) e A is read “ x bears the relation A to y ”, 
and may be written xAy, i.e., 

4.211 Dfn xAy . = .{x,y) e A. 

Relations can be thought of as many-valued functions, so that xAy may 
be read also as “ x is a value of A for the argument y” or “x is an image of 
y by A" , or u y is an original of x, with respect to A" . As a corollary of the 
axiom of extensionality, there is a principle of extensionality for relations: 

4.22 9tel(X).9tel(r) :D: (u,v) e X . = .(u,v) eY].D.X = Y. 

15 | The extensionality principle for relations holds also for n-adic relations, 

in a similar manner. As a result, the existence theorem takes the form: 

M4. Given a normal propositional function 0(xi, ..., x n ), there is exactly 
one n-adic relation A such that 

(xi,...,x n )[(x 1 ,...,x„) eA. = .<f>(x i,...,x„)]. 

The proof is immediate. Take an arbitrary class A! satisfying the condi¬ 
tion, and take A as A! • V n . A is an n-adic relation and is unique because 
of the principle of extensionality, 4.22. 

A, as defined by M4, is denoted by xi,..., x n [0(xi,..., x n )]. If «i,..., 
a n are normal variables, &i,..., a n [<j)(ai, ..., a n )] is by definition the same 
as xi,..., x n [<j>(x 1,..., x n ) .xi e C,... ,x n e C], where C is the range of the 
variables a*. (Note that the symbol * belongs to none of the four kinds of 
symbols introduced on page 11; therefore it must not be used in definitions 
or in applications of M2-M6.) 

The e-relation E and the identity relation I may be defined by means 
of M4. 

4.3 Dfn mel(E).(u,v)[(u,v) e E. = .uev] 

4.31 Dfn 91e[(J). (u,w)[(u,u) e /. = .« = u] 

I is the class of all pairs (u,u). 

The following definitions 4.4, 4.41, 4.411 of the converse relations corre¬ 
spond to the axioms B6, 7, 8. 

4.4 Dfn fftel [€nt> (X)] . (u, u)[<u, v) e <Em>(X). =. (v, u) e X] 

4.41 Dfn 9tel 3 [£no 2 (X)] ■ («, v, w)[<«, v , w) e €w 2 (X). =. 

{v,w,u) eX] 
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4.411 Dfn SHel 3 [£tu> 3 (X)]. (u,v,w)[(u,v,w) e £nt> 3 (X) . = . 

(u,w,v) e X ] 

4.412 Dfn <£nt) (X) is also denoted by Cno^X), X -1 , and X. 

The binary Boolean operations “+” and ” are defined in terms of 

and the complement : 

4.42 Dfn X + Y = -[(.-X) ■ (-!")], 

4.43 Dfn X -Y = X -(-Y). 

4.44 Dfn 2B(X) =D(X- 1 ). 

2U (X) is called domain of values of X. 

The relation “ A confined to B" is written “A1 B”. 

4.5 Dfn Al B = A-{V x B) 

Al B consists of all elements of A which are ordered pairs with second 
member from B. In that sense, “A I B" is “A confined to B ”, since the 
arguments of A are restricted to lie in B. This gives the theorem: 

| 4.51 2)(A1B) = B-S)(A). 

4.512 Dfn 51A = A'(Sx V) 

4.52 Dfn B“X =W{Bt X) 

B“X is the class of all images by B of elements of X. 

4.53 Dfn 

4.6 Dfn 

iltt2 (X) means X is one-to-one, that is, the relation X V 2 is one-to-one. 
If X is a relation, and is single-valued, X is said to be a function. 

4.61 Dfn ?nc(X).= :91el(X).lln(X). 

A function X whose domain is A is called a function over A. 

4.63 Dfn X$nA. = : #nc (X) • 2) (X) = A 

A L x (the A of x) denotes the y such that {y, x) e A, if that y exists and 
is unique; if y does not exist or is not unique, A L x — 0. Hence the defining 
postulate for A l x reads as follows: 

4.65 Dfn (E\y)[(y, x) e A]. D. (A‘x, x) e A : 

~(Ely)[(y, i)eA].3. A‘x = 0 .: Wl (A‘x). 
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The extensionality principle for relations (4.22) gives the following ex- 
tensionality principle for functions: 

4.67 X$nA.Y3nA :D: {u)[ue A.D:X‘u = Y‘u].D.X = Y. 

M5. If tp(ui,..., u n ) is a normal term, if B C V n and if 

(m,..., u n ) e B. D . 9JI (ip(u !,..., u n )), 

then there exists exactly one function C over B such that 

C l (ui,..., u n ) = ip(ui ,... ,u n ) for {ui ,... ,u n ) e B. 

Proof: Define C by the condition: 

(u, u u ..., u n ) e C. =: u = tp{u a ,..., u n ). (ui, ...,u n ) eB. 

Since the right hand side is normal, there is an (n + l)-adic relation C 
satisfying the condition, by M4. C obviously satisfies the conditions of the 
theorem. 

M5 may be generalized as follows: 

M6. If Bi,...,Bk are mutually exclusive, Bi C V n , and if ipi,...,ipk 
are normal terms such that ^(^(ui,..., «„)) for (ui,...,u n ) e Bi, then 
there exists exactly one function C over Bi + B 2 + • • ■ + B n such that 
17 C l (u 1,... ,u n ) = ... ,u n ) for | («i,... ,u n ) e Bi, i = 1,2,..., k. 

We now define five special functions Pi,..., P5 by the following postu¬ 
lates: 


4.71 

Dfn 

Pi 

(x, y) = 

= x.Pi$nV 2 , 

4.72 

Dfn 

P 2 

(*>w) = 

= y.P 2 $nV 2 , 

4.73 

Dfn 

Pi 

(x,y) = 

= (y,x).P 3 dnV 2 , 

4.74 

Dfn 

pi 

{ x,y,z 

) = {z,x,y).P i d nV 3 , 

4.75 

Dfn 

Pi 

( x,y,z ; 

) = (x,z,y).P 5 $nV 3 . 


Existence and unicity of Pi,..., P5 follow from M5. 

4.8 Dfn u e &(X) . = . (3u)[u e v .v e X] 

&(X) is called the sum of X. The following results are immediate: 

4.81 &{x,y} = x + y, 

4.82 6{a:} = x, 

4.83 &{X) = E“X. 
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Now define fppf), the power class of X, the class of subsets of X. 

4.84 Dfn ue%(X). = .uCX 

Some of the operations defined have monotonicity properties, e.g., 

4.85 XCy.D:fl(I)CS(r). 

It is easily verified that 20, ©, and £nt>j have similar properties. Also 

4.86 AC B .X CY :D. A“X C B“Y. 

1, and x have similar properties. 

We also have some distributivities, such as 

4.87 (A x B) ■ (C x D) = (A • C) x {B ■ D). 

This leads to the special case 

4.871 {AxV)-{V xB) = AxB. 

Likewise 

4.88 epf + y) = ©(x) + ©(y), 

4.89 6(X ■ Y) C &(X) • ©(y). 

| The following theorems result from Definitions 4.71-4.75, and are im- 18 
mediate upon inspection. 

4.91 20 (A) = Pi A 

4.92 D (A) = P£A 

4.93 £nt>(A) = P$A 

4.94 £tu> 2 (A) = P^ A 

4.95 €no 3 (A) = P£A 

4.96 V x A = P£A 

The proof for the normality of the notions and operations introduced above 
and also of those introduced later is contained on page 62. 

The results obtained thus far depended on the first two groups of axioms. 
Theorems on the existence of sets depend, however, on the later axioms. 

The following theorem depends on Axiom C4, the axiom of substitution. 


5.1 
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Proof: Since 93?(X), there is a set y, by C4, whose elements are just 
those sets which bear the relation A ■ V 2 to members of X, that is, 
(a) [a e y. = .u e A“X], so that, by the axiom of extensionality, y is 
identical with A K X. Therefore 93? (A “X). 

5.11 93?(X).D.93?(X-y) 

Proof: Substitute I[Y for A in 5.1, obtaining 93?[(J 1 y)“X]. But 
(IfY) u X=X-Y. 

5.12 93?(X).y CX:D.93?(y) 

Proof: y CXo.y = X Y. Now, by 5.11, the theorem is proved. 

5.121 93?(X).D.93?(<P(X)) 

Proof: Axiom C3 provides for the existence of a y such that <}3(X) C y. 
Therefore by 5.12, 991 (<P(X)). 

5.122 93?(X)o.93?(6(X)) 

Proof: This is proved similarly by using Axiom C2 and 5.12. 

5.13 93?(X).93?(y) :D.93?(X + y) 

Proof: If X, Y are sets, we have X + Y — &{X,Y}, and, by Axiom A4, 
{X, y} is a set. Therefore by 5.122, 9Jl(X + Y). The next three theorems 
are proved by 5.1 using 4.91-4.95. 

19 | 5.14 93l[S)(x)] 

5.15 93l[£no,;(:r)] (2 = 1,2,3) 

5.16 m[W(x)] 

From 5.14 and M5 it follows that there is a function Do such that: 

5.17 Dfn Dd'x — 2)(x). DofinV. 

5.18 931 (a; x y) 

Proof: The members of x x y are the pairs {u,v), where u e x, v e y. In 
particular, then, u and v are elements of x + y, so that {a} and {a, a) are 
subsets of x + y. Therefore {{a}, {a, a}} is a subset of ^(.r + y), that is, 
(a, a) C ^{x + y), so that (a, a) e + y)], i.e., ixyC + y)]. 

Therefore 93T(a; x y), by 5.121 and 5.12 and 5.13. 
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5.19 F$nx.D.m(F) 

Proof: F$nx .3. F C (F“x) x x , therefore 9Jt(.F), by 5.1, 5.18, 5.12. 

5.2 Un{F).3.m{Ftx) 

Proof: F 1 x is a function over D(F 1 x), and D(F I'iJCi, hence D(F 1 x) 
is a set. Hence the theorem holds by 5.19. 

5.3 971(0) 

Proof: 0 C x, therefore 971(0), by 5.12. 

5.31 -971 (Y) 

Proof: x e V; therefore if 9J1(Y) we would have V e V, but this is impossi¬ 
ble, by 1.6. 

5.4 <pt(X).D.q3t(6(X)) 

Proof: Suppose 97t(6(X)); then 97t(^3(6(X))), but X C ^3(6(X)), there¬ 
fore 9J1 (X), contrary to the hypothesis. 

Similarly: 

5.41 q3t(X).D.q3t(<p(X)), 

5.42 <Pt(X).D.<Pt(X + Y), 

5.43 <Pt(X).~©n(Y) :D.<Pt(X x Y). 

Proof: X C 6[6(X x Y)], if Y ^ 0. 

5.44 Uti2(F).XCS(f) : 3 : q3t(X). D. (F“X), 

| that is, a one-to-one image of a proper class is a proper class. The proof 
follows from the fact that X C F“(F“X), if X C T)(F). Therefore, if F"X 
were a set, X would also be a set by 5.1 and 5.12. 

5.45 q3t(A).I>.<pt(X-a:) 

This follows from the inclusion A C {A — x) + x, and 5.13. 
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21 | Chapter III 

Ordinal numbers 

Ordinal numbers may now be defined, with the aid of some preliminary 
definitions. 

6.1 Dfn YdonX . = .X 2 CY + Y- 1 +1, 

that is, Y is connex in X if, for any pair of distinct elements u. v of X, 
either ( u , v) e Y or ( v , it) e Y. 

6.11 Dfn Y is called transitive in X if, for all elements u, v, w of X , 

{u,v) e Y. (v,w) eY :D. {u,w) e Y. 

6.12 Dfn Y is called asymmetric in X if, for no elements u, v of X, 

(it, v) e Y. (v, it) e Y. 

6.2 Dfn XWeY ,= :Y€onX .(U)[U ^O.U CX:D 

. (3u)[i; eU.U- Y“{v} — 0]] 

that is, X is well-ordered by Y if Y is connex in X and any non-void 
subset U of X has a first element with respect to the ordering F, since 
U • F“{i>} = 0 says that there is no member of U which bears F to v. 
Note that the symbol XWtY here introduced is not normal, because of 
the bound variable U. u 

6.21 If X 2UeF, then F is transitive and asymmetric in X. 

Proof: F is asymmetric in X, since if xYy and yYx the class {x, y} has no 
first element. In order to prove the transitivity in X, suppose xYy and yYz: 


14 [Note added in 1951: The statements made after Dfns 6.2 and 8.1, and on page 
62, to the effect that 2ZJe and ~ are not normal are incorrect, if normality is defined as 
on page 12. According to this definition normality of a concept has nothing to do with 
the way in which it is defined but only depends on its extension. Therefore all that, 
prima facie, can be said about Wt and ~ is that they cannot be proved to be normal by 
the method applied to the other concepts on page 62. They can however be proved to 
be normal in a different way, provided the axiom of choice is assumed. For, under this 
assumption, it can be proved that 

X ~ Y- = .X y.V : 'Pt(X).<Pr(y) 

(cf. von Neumann 1929). Moreover U can be replaced by u in Dfn 6.2 because the 
existence of a class without first element implies the existence in it of a descending 
sequence of type w. The latter proof requires the singling out of one element in every 
non-empty class, which however can be accomplished by considering, in every class, the 
subset of elements of lowest “Stufe” (in the sense of von Neumann 1929, page 238).] 
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then x / z because of the asymmetry, hence either xYz or zYx. Consider 
U = {a;} + {y} + {z}. If zYx, U will have no first element, therefore xYz. 

6.3 Dfn X6erfiiy. = :Xcr.[r.(tf“X)CX], 

that is, X is an R-section of Y if all //-predecessors in Y of members of X 
also belong to X. 

6.30 Dfn X is called a proper //-section of Y if it is an //-section of 
Y and ± Y. 

| 6.31 Dfn 6eg n (X,u) = X- R“{u}, 

that is, if u e X, the R-segment of X generated by u is the class of elements 
of X which are //-predecessors of u. 

6.32 &eg R (X,u) is an //-section of X, if u e X and if R is transitive 
in X. 


Therefore 

6.33 If X WeR, then any //-segment generated by an element of X is 
an //-section. 

Conversely, if X WeR and Y is a proper //-section of X, then Y is an 
//-segment of X , namely the one generated by the first element of X — Y. 

If R is a one-to-one relation with domain A and converse domain B, 
then R is called an isomorphism from A to B with respect to S and T if 
for any pair u, v of A such that uSv the corresponding elements of B are 
in the relation T. and conversely, i.e., 

6.4 Dfn R 3fom S)T 04, B). =: iln 2 (/Z) .SKet (//).©(//) = A. 

W (ZZ) = B. (u, »)[u e A . v e A : D: uSv . =. {R l u)T(R'v)]. 

If there exists an isomorpism from A to B with respect to S and T, A is 
called isomorphic to B with respect to S and T. If S = T in 6.4, R is said 
to be an isomorphism from A to B with respect to S. 

6.41 Dfn R is called an isomorphism with respect to S if it is an 
isomorphism from ®(R) to 2U(ZZ) with respect to S. 

“Isomorphism with respect to an n-adic relation S” is defined accordingly. 

The method to be used in constructing the ordinals is due essentially to 
J. von Neumann. The ordinal a will be the class of all ordinals less than 
a. For instance, 0 = the null set, 1 = {0}, 2 = {0,1}, ui = the set of 
all integers, etc. In this way, the class of ordinals will be well-ordered by 
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the e-relation, so that a e /? corresponds to a < /?. Any ordinal will itself 
be well-ordered by the e-relation, since an ordinal is a class of ordinals. 
Moreover, any element of an ordinal must be identical with the segment 
generated by itself, since this segment is the set of all smaller ordinals. 
These considerations lead to the following definition: 

Definition: X is an ordinal if 

1. XWtE, 

2. ueX :D.u = Gtg E {X,u). 

23 | However, as shown by R. M. Robinson {1937, page 35; Bernays showed 

previously that transitivity of E in X and 2' are sufficient), conditions 1 
and 2 may be replaced, owing to Axiom D, by the weaker conditions: 

1'. E€onX, 

2'. ueX.D.u CX. 

X is said to be complete if it has the property 2', i.e., if any element of an 
element of X is an element of X, that is, 

6.5 Dfn Comp(X). = .(u)[ueX.D.uCX]. 

6.51 Comp(X). = .6(X)CX 

The proof is immediate from 6.5 and 4.8. 

6.6 Dfn DtO(X). = : Comp {X).E Con X 

This definition combines conditions 1' and 2'. An ordinal which is a set is 
called an ordinal number, denoted by 0{X). 

6.61 Dfn D(X). = :Dtl>(X).9tt(X) 

The class of ordinal numbers is denoted by On. (Concerning the normality 
of DrO, cf. page 62.) 

6.62 Dfn xeOn. = .D(x) 

Dfn The letters a,/3, 7 ,... will be used to denote variables whose 
range is the class of ordinal numbers. Evidently these variables 
are normal. 

6.63 Dfn X <Y . = .X eY 

6.64 Dfn X <Y. = :X <Y.V.X = Y 

6.65 Comp (X). Comp {Y) : D: Comp (X -I- Y) . Comp {X ■ Y) 

Proof: By 4.88, &{X + Y) = 6(X) + 6(F). Therefore, by 6.51, we have 
Comp (X + Y). Similarly for X Y by 4.89. 
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The next step is to show that the definition 6.6 is equivalent to the 
stronger definition, i.e., 

6.7 1. £)tO{X)D XWtE, 

2. DrO (X). u e X Du — &tg E (X, u). 

Proof of 1: Given any non-void subset Y of X, there exists u, by Axiom D, 
such that ueY and Y ■ u = 0, that is, Y ■ E“{u} = 0, since u = 6[{u}] — 
E“{u} by 4.83, 4.82. Therefore XWeE, by Definition 6.2, since EdonX, 
by definition of DtZ>. 

Proof of 2: If Dth (X) and u e X, then &eg E (X, u) = X-E“{u} = X-u = u, 
by Definition 6.31 and the completeness of X. 

| 7.1 £>«(*).T Cl :D: Comp (T). D. y e X 

Proof: &(Y) C Y, so that E“Y C Y by 4.83. Therefore, by Definition 6.3, 
Y is a section of X. Hence by 6.33 Y must be a segment of X, generated 
by some element u of X. But then T = it, by 6.7, hence Y e X. 

7.11 Ot0(X).Otd(Y) :D:Y CX. = .YeX 

Proof: Since Y is an ordinal, it is complete. Therefore 7.1 establishes one 
half of the equivalence. The other half merely expresses the fact that X is 
complete, since Y = X is excluded by 1.6. 

7.12 If X and Y are ordinals, one and only one of the following rela¬ 
tions holds: 

X e Y, X = Y, YeX. 

Proof: X Y C X and X Y C.Y. Suppose now that X Y C X and 
X Y C Y ; then X YeX and X ■ Y e Y, by 7.1, since the intersection of 
two complete classes is complete (6.65). But this implies that X Y e X-Y, 
which is impossible, by 1.6 and Axiom A2. Therefore either X Y = X or 
X -Y =Y, i.e., either Y C X or X C Y, i.e., X C Y .V. X = Y. V. Y C X, 
hence XeY.V.X — Y.V.YeX by 7.11. Therefore at least one of 
the three relations holds. Moreover no two can hold simultaneously, since 
X e X or X e Y .Y e X are impossible, by 1.6 and 1.7 and Axiom A2. 

7.12 and 6.63 express the fact that any two ordinals are comparable. By 
6 .1, this implies the statement: 

7.13 E <Con On. 

7.14 Ox0(A).D.ACOn 
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Proof: Let A be an ordinal and x an element of A. We have to show 
that EConx and Comp (a;). Take z e y, y e x; then, since A is complete, 
y e A; then iterating, z e A. E is a relation of well-ordering for A, therefore 
transitive in A by 6.21, so that z e x. Therefore x is complete. E Con A 
and x C A, so that E Conx. 

7.15 Comp (On) 

Proof: By 7.14, x e On. D. x C On. 

7.16 Drh (On) 

Proof: 7.13, 7.15, 6.6. 

25 7.161 On (and therefore any class of ordinal numbers) is well-|ordered 

by E. 

This follows immediately from 7.16 and 6.7 and allows us to prove prop¬ 
erties of ordinal numbers by transfinite induction, if the property under 
consideration is defined by a normal propositional function, since under 
this assumption the class of ordinal numbers not having the property ex¬ 
ists by M2 and (if not empty) contains a smallest element by 7.161 and 
Definition 6.2. By an inductive proof is always meant the reductio ad ab- 
surdum of the existence of a smallest ordinal not having the property under 
consideration. 

By 7.14, any element of an ordinal number is itself an ordinal number, 
so that an ordinal number x is identical with the set of ordinals less than 
x, recalling that the e-relation is the ordering relation for ordinals. 

7.17 fpt(On) 

Proof: On is an ordinal, so that On would be an ordinal number, if DJI (On), 
hence On e On, which is impossible (1.6). 

7.2 DrO (X) ,D:Ie On .V. X = On. The only ordinal not an ordinal 
number is On. 

Proof: By 7.14, X C On. If X C On, by 7.11, X e On. 

7.21 Any .E-section of an ordinal is an ordinal. 

Proof: Any proper .E-section of an ordinal X is (by 6.33 and 6.7(2)) an 
element of X, hence an ordinal by 7.14. A non-proper E-section of X is 
identical with X. 
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7.3 A C On. D. OtD [6(A)] 

Proof: 6 (A) is complete since, if x e © (A), there is an ordinal a such that 
x e a e A; then if y e x, y e a, since a is complete, that is y e &(A). Also 
E <£on6(A); for take x ^ y, elements of 6(A); then x e a e A, y e (3 e A. a 
and (3 are comparable so that either a C (3 or (3 C a. Then both x and y 
are members of the larger of a and (3, so that x e y or y e x, since ECona 
and E€o n/3, that is E £on6(A). Therefore £>tD[6(A)]. 

6(A) is the smallest ordinal greater than or equal to all elements of A, 
i.e., is either the maximum or the limit of the ordinals of A according as to 
whether there is or is not a greatest ordinal in A. Therefore we use “£im” 
and “9Jtap” to denote the same operation as 6. 

7.31 Dfn £im (A) = 6 (A) 

(A) = 6 (A) 

| 7.4 Dfn x + 1 = x + {x} 

This defines the successor relation for ordinal numbers as seen by Theorems 
7.41, 7.411. 

7.41 x + leOn. = .xeOn 
This is easily proved. 

7.411 ~(3/?)[a < /? < a + 1] 

Proof: Suppose a < /3 < a + 1; then /3 e a + 1, that is /3 e a + {a}; that 
is (3 e a or (3 = a, so that f3 <a. 

Ordinal numbers are to be classified into ordinal numbers of the first 
kind and ordinal numbers of the second kind, as follows: 

7.42 Dfn x e K{ . = : (3a)[x = a + 1]. V. x = 0. 

x is of the first kind if it is the successor of an ordinal number or 0. Oth¬ 
erwise x is of the second kind. 

7.43 Dfn K n = On-Kj 

7.44 Dfn 1 = 0+1 

7.45 Dfn 2=1 + 1 

Likewise 3 = 2 + 1, etc. Evidently we have: 

7.451 If m is a set of ordinal numbers, the ordinal a — 6 (m) + 1 is an 
ordinal number greater than any element of m. 
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It will now be shown that it is possible to define functions over On 
by means of transfinite induction, i.e., determining F l a by means of the 
behavior of F for ordinal numbers less than a. Since a is the class of 
ordinals less than a, F t a is F confined to arguments less than a. Therefore 
the induction should have the form F‘a = G‘(F[ a), where G is a known 
function. The following theorem, then, is what is needed: 

7.5 ( G){E\F)[F$nOn . ( ot)(F‘a = G‘(F 1 a))]. 

Proof: Let us construct F. First, by the existence theorem M2, there exists 
a class K such that: 

feK=. • (o)[a e (3.D. f l a = G‘(f T a)]]. 

Now set F = &{K). If /, g e K, where / $n/3 .3 ./3 < 7 , it follows 

that f = gl (3, because for a e (3 both / and g satisfy 

(*) f‘ a = G l (f T a)j 

and this equation determines an / over f3 uniquely, as is seen by an induc¬ 
tion on a. This means that any two f,geK coincide within the common 
part of their domains. Therefore F will be a function and its domain will 
27 be the sum | of the domains of all f t K (i.e., ’S(F) = &(Do“K)) and 
F will coincide with each f e K within 33 (/). F will satisfy (*) for each 
a e 33 (F), because a e 33 (F) implies a e 33 (/) e On for some f e K where 
/ satisfies (*) in 33 (/) and f = Fl"S(f). Now 33 ( F) is an ordinal by 7.3, 
but cannot be an ordinal number a because otherwise F could be extended 
to a function H over a 4- 1, by virtue of (*) and M 6 . But then Wl(H), 
by 5.19, hence H e K, which would imply a + 1 C a. The unicity of F 
follows by an induction on a. 

7.6 Dfn An ordinal function is a function G over an ordinal, with 
ordinal numbers as values, that is G Jno (for some a) or G-SnOn. 
and 2U (G) C On. 

7.61 Dfn An ordinal function G is said to be strictly monotonic if 
a < /3.D.G‘a < G l (3 for a, 0 e S)(G). 

By induction it follows that: 

7.611 If G is strictly monotonic, then G‘a > a for a e 33 (G). 

From this it follows that no two different ordinals X and Y can be isomor¬ 
phic with respect to E, 
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7.62 Dr0(X).OtJ)(y).Haf0TO BB (XF) :D: X = Y H = HX. 

Proof: By definition of an isomorphism we have: if a, /3 are elements of X 
such that a e /3, then H‘a e H l f3, that is, H is strictly monotonic, so that 
by 7.611 H l a > a for a e X. Likewise, > £T‘a, that is, a > H l a 

for a e X; it follows that H l a = a for a e X, in other words, X = Y, and 

h =/rx. 

As a consequence of 7.62, a well-ordered class can be isomorphic to 
at most one ordinal. Sufficient conditions for a well-ordered class to be 
isomorphic to an ordinal are given by the following theorem. 

7.7 1. If *$r(A) and AWeW, and if any proper Bisection of A is a 

set, then A is isomorphic to On with respect to W and E. 

2. If aWeW, a is isomorphic to an ordinal number with respect 
to W and E. 

Proof of 1: Let F l a be defined by induction as the first element of A 
which has not yet occurred as a value of F, that is F‘a = first element of 
A — 2D (F 1 a). In order to prove the existence of F by 7.5, this condition 
must be expressed in the form 

(*) F‘a = G‘(F la). 

Define G by the condition: 

(y,x)eG. = :ye(A- W (X)). (A-W(x)). W“{y} = 0, 

and define F by (*) and the condition FfinOn. Then G‘x e A — ZO(x') 
for any set x because A — 2D (x) is a proper class by 5.45, 5.16, | hence 
t^ 0. Therefore F l a e A — 2D (F f a) for any a by (*), hence 2T ( F) C A. 
Moreover F is one-to-one, so that QU (F), being a one-to-one image of the 
proper class On, is itself a proper class by 5.44. But W (F) is a section of 
A, hence by the hypothesis cannot be a proper section, i.e., 2D (F) — A. In 
addition, it is easily seen that a < /3 . = . (F‘a)W(F‘j3). 

Proof of 2: Construct G and F exactly as in the proof of 1, replacing A 
by a. Now it can be shown that s-2IJ(i i ’l'a) = 0 for some a. In fact, 
suppose that ( a)[a - W(Ft a) / 0]; then we could conclude, as before, 
that 2D (F) C a. Then W (F) would be, as before, a proper class; but this 
is impossible, since a is a set. Therefore (3a)[a — W (F a) — 0]. Then, 
if a is the smallest ordinal of this kind, F T a establishes the isomorphism 
between a and a. From the axiom of choice it follows: 

*7.71 For any set a there exists an ordinal number a and a one-to-one 
function g over a such that a = g u a. 
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Proof: By Axiom E, the axiom of choice, there is a function C over V such 
that x ± 0. D. C‘x e x. Define F by the postulate 

(a)[F‘a = C‘(a — 2U (F T a))] 

and F$nOn. Existence and unicity of F follow from 7.5, if first G is defined 
by G l x = C‘(a - 20(a;)), G3hV, using M5. As in the second part of 7.7, 
it is shown that there exists an a such that (a — 23J (F T a)) = 0. Then if a 
is the smallest ordinal of this kind, F I* a can be taken as g. 

It is desirable to assign a well-ordering for the ordered pairs of ordinal 
numbers: 

7.8 Dfn (a,0)Le('),6) . = : 0 < 6 .V. (0 = 8. a < 7)Le C (On 2 ) 2 , 

7.81 Dfn (a,/?)i?( 7 ,tf).= :9Hap{a,/?} < 97lc?{ 7 ,5} . V 

. [OTaj:{a, 0} = Orta^, 8} . (a, 0)Le( 7 , 6 )].: R C (On 2 ) 2 . 

The existence of an Le satisfying 7.8 follows from M4 since the relation Le 
defined by 

(x,y) e Le = ( 3 q,/ 3 , 7 , £)[a; = (a, 0). y = ( 7 ,6 ) : < 6 .V. (13 = 6. a < 7 )] 

evidently satisfies 7.8. Similarly for R. On 2 is well-ordered by R in such a 
way that: 

7.811 Any proper A-section of On 2 is a set. 

Proof: Consider a pair (fi,v) such that (p, v)R(a,(i)\ then 

ma^{n,v) < 9Jta?:{a,/?} < 9Jtqj:{a,/?} -i- 1. 

Therefore /i, v e [SWaj{a, 0} + 1], so that (/it, v) e a, where a - [£DTap{cv, 0} 
4- l] 2 . a is a set by 5.18. Therefore the class of all pairs (y, v) such that 
(y, 1s)R{a, 0) is contained in the set a, hence is itself a set. 

29 | Now, applying 7.7 (since On 2 is a proper class by 7.17, 5.43), we have: 

7.82 On 2 is isomorphic to On, with respect to R and E. Let the 
isomorphism from On 2 to On be denoted by P, i.e., 

7.9 Dfn P Jn On 2 .W(P) = On: 

(a, 0, 7 , *)[(<*, 0)R( 1 , S).D. P‘(a, 0) < P‘< 7 , 6)]. 

7.91 P l (a,0) > Ottawa,/?} 

Proof: Take 7 = 97loj{a,/?}. Then P l (a,0) > P‘{ 7 , 0 ) by 7.9; but, since 
P‘( 7 ,0) considered as a function of 7 is strictly monotonic by 7.9, we have 
7 < P‘( 7 , 0 ) by 7.611, i.e., P l (a,0) > SDlatfa, 0}. 
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Chapter IV 
Cardinal numbers 

We can now proceed with the theory of cardinals. Most of the theorems 
and definitions of this chapter (except those concerning finite cardinals) 
depend in our development on the axiom of choice, although its use could 
be avoided in many cases. Two classes X and Y are said to be equivalent 
if there is a one-to-one correspondence between the elements of each, i.e., 

8.1 Dfn X~Y. = .(3Z)[Un 2 (Z).m(Z).'D{Z) = X.W{Z) = Y]. 

This notion is not normal; 15 the corresponding normal notion is as fol¬ 
lows: 

8.12 Dfn X~'Y. = .(3z)[iiMz)-^(z).D(z) = X.W(z) = Y]. 
8.121 x~y. = .x~'y 

Proof: A class Z satisfying the right hand side of 8.1 for two sets X, Y is 
a set by 5.19. 

8.13 Dfn (x, y) e Aeq. = .x ~ y : 9\el(Aeq) 

The cardinal of X, denoted by X, is defined by the postulate: 16 

*8.2 Dfn x ~ f .1 e On. (a)[a < 1 .D.~(a ~ x)] .fpt(.X') . D. 

T = On. 

By Theorem 7.71 it is seen that X exists. The unicity is immediate. X 
is a normal operation, since 

Xcf.= : XeOn.(a)[astfY.D.Xea]. 

Hence by M5 there exists a function Nc over V such that N&x = W for any 
set x. 

*8.20 Dfn Nc i x = W.Nc3nV 

The cardinal of a set is called a cardinal number , i.e., the class N of 
cardinal numbers is defined by: 


15 [Note added in 1951: See Note 14 on page 21.] 

16 [JVote added in 1951: Dfn 8.2, for the case that tP r is justified by J. von 
Neumann’s result (concerning ~) quoted in Note 14.] 
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*8.21 Dfn N = W(Nc). 

*8.22 N COn 

This follows immediately from 8.2, 8.21. 

31 | An ordinal number is a cardinal number if and only if it is equivalent to 

no smaller ordinal, i.e., if it is an initial number in the usual terminology. 17 

The next five theorems are immediate consequences of the definition of 
cardinals. 

*8.23 x e N . = .x — x 

*8.24 I~x 

*8.25 x_~y. = .W = y 

*8.26 a < a 

*8.27 Nc l [Nc‘x] = N_c‘x 

*8.28 x Cy .D.x <y 

Proof: y ~ y, therefore there exists a z C y such that x ~ z. z is a 
set of ordinal numbers, hence well-ordered by E, hence is isomorphic to 
an ordinal number /? by 7.7, i.e., there is an h such that h^ornsE (/3, z). 
Hence a < h l a for a e f3 by 7.611. But_/i‘a e z C y for a £ /3. Hence 
a e /3. D. a < Wa e y, that is /3 C y. But (3 = z, therefore z = (3 < (3 <y. 

The Schroeder-Bernstein Theorem appears as a consequence. Namely, 
if x ~ t C y and y ~ u C x, then x — y, since W = t <y, and y = u < x. 
This proof depends, however, on the axiom of choice. 

The proofs of the next three theorems are omitted. 

*8.3 a 4- 1 < a 2 for a > 1 

*8.31 iln (A) .D.A“x <x 

*8.32 (x) > f (Cantor’s theorem) 

*8.33 «Pt(lV) 

Proof: Take m C N; then by 8.32 fp(6(m)) > &(m). But 6(m) > a, 
where a is any member of m by 8.28, 8.23. Therefore there is a cardinal 
number greater than any element of m; hence m / N, i.e., N can not be a 
set. 

We now define the class uj of integers: 

8.4 Dfn x e w. = .x + {x} C Ki, 


17 This treatment of cardinals is due 


Neumann (cf. 1928a, p. 731). 
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i.e., x is an integer if it is an ordinal number of the first kind and if all 
smaller ordinals are likewise of the first kind. It follows immediately that: 

8.41 atu.D.a + lew and a e uj . /3 < a :D. (3 e w. 

8.42 Dfn i, k are variables whose range is w. 

The principle of induction holds for integers: 

8.44 0 e A.(k)[ke A.D.k + lei]:D.wC A. 

| Proof: If u; C A is false, there must be a smallest i such that i is not 32 
a member of A. This leads to a contradiction with the hypothesis, since 
either i — 0 or i = k + 1 by 8.4 and 7.42. 

Functions over the class of integers may be defined inductively: 

8.45 (a, G){E\F)[F$nu . F‘0 = a . ( k)(F l (k + 1) = G‘(F l k))]. 

This can be proved either by specializing G in 7.5 or by arguments 
similar to those used in the proof of 7.5, 

8.46 i ^ k ,D.~(i ~ k ) 

This can be proved by induction on integers, since 
i + l~k + lDi~k. 

8.461 a/fc.D.~(a~fc) 

Proof by induction on k, since fc + 1 ~ a > u; would imply k ~ a. 

*8.47 i e N 

This follows from 8.46. 

A class is called finite if it is equivalent to an integer; otherwise infinite, 
i.e., 

8.48 Dfn Jin (x). =. (3 a)[a e u>. ax], 

8.49 Dfn 3nf (#). = . ~Jin (x). 

8.491 Jin (x). z C x O. Jin(,z) 

Jin (x) . Jin (y) : D : Jtn (x + y). Jin (x x y) 

This is proved by an induction on the integer i equivalent to x. 


8.492 Jin (a) , = .a e u) 
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This follows from 8.461. 

8.5 DrO (w) 

Proof: w is a class of ordinal numbers, hence E<tonu. Moreover, every 
element of an integer is an integer by Definition 8.4, that is, £omp(oj). 
Therefore DtO (w). 

8.51 SDt(w) 

Proof: Axiom Cl (the axiom of infinity) provides for the existence of a 
33 non-empty set b, such that for every x e b there is | a y e b which contains 
exactly one element more than x; namely take for b the class of all subsets of 
elements of the set a, whose existence is postulated by Axiom Cl (b is a set 
because b C %}[& (a)]). Now consider the class c defined by c = (w 1 Aeq) “ b , 
i.e., the class of integers equivalent to elements of b. c is a set by 5.1 
and 8.46, and w C c, as can be shown by induction owing to the above 
mentioned property of b. 

8.52 weK n 

Proof: rew.D.r + lew, by 8.41. If w = a + 1, we would have a e ui by 
7.4, hence a 4- 1 e lj, i.e., w e w, which is impossible. 

8.53 There exists an ordinal number of the second kind. 

Proof: By 8.52, u) is such an ordinal. 

*8.54 Dfn N' = N-u 
*8.55 N' C On 

*8.56 N' is isomorphic to On with respect to E. 

Proof: tyt(N') by 5.45, since Moreover any proper section of N' is 

generated by an a e AT', hence C a, hence a set. Therefore 7.7 gives the 
result. The isomorphism from On to N’ is denoted by N, i.e., 

*8.57 Dfn (On, AT')- 

It follows: 

*8.58 N‘0 = o; 

since w e N by 8.461. N 7 and w 7 are defined by: 


*8.59 Dfn N 7 = w 7 = N‘ 7 . 
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*8.62 = H a 

Proof: Assuming 7 to be the smallest ordinal number for which H 2 N 7 , we 
prove u) 2 ~ uj 1 . To this end, owing to the Schroeder Bernstein Theorem, it 
is sufficient to show P u (w 2 ) C u> 7 , i.e., P l (a,/3) < w 7 for a,/3 < w 7 , where 
P is the function defined by 7.9. Since, for every 6 , 6 < w 7 . = . 6 < cu 7 , 
it is sufficient to show: P l (a,/3) < w 7 for a, /? < w 7 . Now P L (a,/3) is the 
power of the set of ordinals < P‘(a, /?). This set by definition of P (7.9) 
is mapped by P on the set m of pairs preceding (a, /?) in the ordering R. 
Hence P l (a,f3) = W, but (as seen in the proof of 7.811) m C (/z + l) 2 , 
where fj, — 9Ttctc{a, /?}. 

| Now we distinguish two cases: _ 

1. (i is finite: then (/ j. 4- l) 2 < to by 8.491. Hence m < (ft + l) 2 < u < 
w 7 in this case. 

2 . n is infinite: then, since pt < cu 7 by assumption, Ji = wg for some 
6 < 7 . Hence fi 2 = JI by the inductive assumption. Hence (using *8.3) 
m < (pi 4- l ) 2 < (m 2 ) 2 = Ji < w 7 also in this case. 

It results that: 

*8.621 For any infinite set x, x 2 = x, 
and therefore 

*8.63 3nf (x). y ± 0 : D: xv.y = x + y = 9Haj: (x, y). 

Furthermore 

*8.64 If for any y em, F l y < a, then 6 ( F“m) < axm. 

The proofs of these results on cardinals are not included, since they do not 
differ from the usual proofs. 

8.7 Dfn A is closed with respect to R if R“A C A. 

8.71 Dfn A is closed with respect to S as a triadic relation if 
S“(A 2 ) C A. 

8.72 Y is called closure of X with respect to Ri,...,Rk and with 
respect to Si ,..., Sj as triadic relations if Y is the smallest class 
including X which is closed with respect to the R’s and closed 
with respect to the S’s as triadic relations. 

The existence of this class will be needed only under the following condi¬ 
tions: 
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*8.73 If 9JI (X) and if the R' s and S' s are single-valued, then the closure 
Y exists and is a set, and if in addition X is infinite then Y = X. 

Proof: Define Gf$nV as follows: 

G'x — x + Ri x -1- R£x + S{ 1 (x 2 ) H-h Sj(x 2 ). 

The right-hand side is normal and by 5.1, 5.13, 5.18 is a set for any set x; 
hence G exists by M5. Now define ffinw by 8.45 as follows: 

/‘ 0 = x, f\k + 1) = G l f l (k). 

Now consider 6 (/“u>); this is a set, and satisfies the requirements of Def¬ 
inition 8.72. Now for any infinite set y we have G l y — y by 8.31, 8.621, 
8.63. Therefore, if x is infinite, f‘n — 0 = x, by complete induction on 

n. Hence 

S(/“w) < x x oj = 9Jtap(x, w) = x by 8.64, 8.63 and 
©(/“w) > = f by 8.28. 


35 | Chapter V 

The model A 

The classes and sets of the model A will form a certain subfamily of 
the classes and sets of our original system S, and the e-relation of the 
model A will be the original e-relation confined to the classes and sets 
of A. We call the classes and sets of A constructible, and denote the 
notion of constructible class by £ and the class of constructible sets by L. 
Constructible sets are those which can be obtained by iterated application 
of the operations given by Axioms A4, Bl-8, modified so that they yield sets 
if applied to sets. In addition, at certain stages of this generating process 
the set of all previously obtained sets will be added as a new constructible 
set. This permits the generating process to continue into the transfinite. 
The above mentioned axioms lead to the following eight binary operations 
3l, • • • )#8 called fundamental operations: 

9.1 Dfn $i{X,Y) = {X,F}, 

&{X,Y) = E-X, 

$3 (X,Y) = X-Y, 

(i.e., = x ■ (v x y)), 
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8 6 {X,Y) = X-Q{Y), 

UX,Y) = X-Y~\ 

$t(X, Y) = X ■ Cnt> 2 (Y), 

$ s (X,Y) = X-€m 3 (y). 

The factor X in $2, ■ ■ ■, $8 is added for reasons that will appear later (The¬ 
orem 9.5). The operation of intersection (given by Axiom B2) is left out 
because X Y = X — (X — y). Owing to 4.92-4.96, # 4 ,... ,5s can be 
expressed differently as follows: 

9.11 3 4 (X,Y) = X-P 2 Y, 

$ 5 (X,Y) = X-P£Y, 

3 6 (X,Y) = X-PSY, 

&{X,Y) = X-P£Y, 
ds(X,Y) = X ■ P5Y. 

In other words, 

9.12 $i(X,Y) = X Q?Y, * = 4,...,8, 
where the Qi are defined by 

| 9.14 Qi -- P 2 , Q5 — P 2 , Qg = P 3, Qi = Pi, Qs = Ph- 36 

By means of Theorem 5.11 it is seen that all the fundamental operations 
give sets when applied to sets. 

Now consider the class 9 x On 2 (i.e., the class of triples {i, a, /?}, i < 9) 
and define the following well-ordering relation S for it: 

9.2 Dfn pL,v < 9.D.: (ft,a,(3)S{v,'y,8). = : 

(a,f3)R('y,6) .V.((a,/3> = ( 7 ,<5) ./x < v) :: 5 C (9 x On 2 ) 2 , 

where R is the relation defined by 7.81. Concerning the existence of S, cf. 
Definition 7.8. Since 

(i,a, / 3)S(j, , 7 ,£) .O: (a ,/ J)fl< 7 , S) .V. (a, 0) = ( 7 ,S), 

it follows from 7.811 and 5.18 that any proper 5-section of 9 x On 2 is a set. 

But 9 x On 2 is not a set by 5.43. Hence 9 x On 2 is isomorphic to On with 
respect to 5 and E by 7.7, i.e., there exists a J satisfying the following 
defining postulate: 

9.21 Dfn J3ti(9 x On 2 ) .W(J) = On : /x, «/ < 9. D 

■ [(/x, a, 0) S{v, 7 , 6 ). D. J‘ (/x, a, 0) < J‘ (v, 7 , £)]. 
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Now we define nine functions Jo,..., J» over On 2 by: 

9.22 Dfn Jo (a,/3) = J‘(0,a,/?), J 0 $nOn 2 , 

Js (a, 0) = J‘( 8 , a, 0), J&nOn 2 . 

Evidently we have: 

9.23 The 2U (./*), i = 0,..., 8 , are mutually exclusive and their sum is 
On. (It is easily seen, but not used in the sequel, that the 2U ( J t ) 
are the congruence classes of On mod.9 and that Jl{a,0) = 
9 x P‘{a,f3) + i, where 4- and x denote arithmetic addition 
and multiplication of ordinals.) 

By definition of J there exists for any 7 a unique triple (i, a, (3) such 
that 7 = J‘(j,a,/3). Hence there are two functions K\, K 2 over On such 
that: K{Ji{a,f3) = a, K 2 Ji (a, f3) = [3, for any i < 9. Ki,K 2 are defined 
by: 

9.24 Dfn (a, 7 ) e K\ = (3jU,/?)[// < 9.7 = J‘(fj,,a,(3)] .K\ C On 2 , 

(13, l)eK 2 = (3n, cc)\jx < 9. 7 = J l (n, a, 0)]. K 2 C On 2 . 

For the J< and Ki we have the following theorems: 

9.25 Ji(a,f3) > %Jlat{a,(3}, 

Ji (a,f3) > ?Olai{a,(3} for i ± 0, 

K[ a <a, K 2 a < a, 

37 | K{a<a, K 2 a < a for a^2D(Jo). 

Proof: Set 9Jlay{a,/3} = 7 ; then we have Jo { a,/3) > Jo ( 7 , 0 ) by Definition 
9.21; Jo ( 7 , 0 ) > 7 by 7.611; Ji(a,f3) > Jo (a,/3 ) for i ^ 0 by Definition 
9.21. Writing the last three inequalities as one inequality, we obtain (for 
* ^ 0): 

Ji(a,/3)> Jt(a,P)> Jo (7,0) > 7 , 

which gives the first two statements of 9.25. The last two express the same 
facts in terms of K\ and K 2 . 

*9.26 a, (3 < D Ji {a, j3) < 

Proof: By Definition 9.21 J maps the set m of triples preceding (i, a, (3) in 
the ordering S on the set of ordinals < Jj(a,/3). Hence J‘ (a, (3) ~ m. But 
m C 9 x (SUtap{«, (3) + l ) 2 by 9.2 and 7.81. Hence the theorem by 8.491 or 
8.63 according as 7 = 0 or 7 > 0 (using 8.492 in the first case). Note that 
the axiom of choice is not used in the case 7 = 0 . 
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*9.27 w a e20(J o ) 

Proof: oj a < J‘(0,w Q ,0) by 9.25; but not u> a < J‘(0,w a ,0), because this 
would imply iv a = J‘(i, 7 ,6) for some triple (i,-y,S) preceding (0, oj a ,Q) in 
the ordering S. But {i,j, 6)S{0,uj a ,0) implies 7 , 6 < u /<*, hence 

by 9.26. Hence = Jo {w a , 0), i.e., w a e 20(J O ). For a = 0 the axiom of 
choice is not used in this argument. 

Now we define by transfinite induction a function F (the letter F is to 
be used only as a constant from now on. A similar remark applies to J?, 
S, C defined respectively by 7.81, 9.2, 11.81) over On by the following 
postulates: 

9.3 Dfn aeW(Jo) DF‘a = W(Fta), 

a e 20 (Ji) D F‘a = & (F‘Ki a, F l Ki a), 


a e 20 (J 8 ) D F l a = $ 8 (F l Ki a, F‘K± a), 

F On. 

In order to prove the existence of F by 7.5, it is necessary to define 
first a function G over V by the following postulates: If S (x) e 20 (Jo), 
G‘x = 20 (x); if 2) (x) e 20 (Ji), * = 1,2,..., 8, 

G l x = &[x7£i‘2>(*), x‘A 2 ‘2)(x)]; 

and G‘x — 0 everywhere else. Since all symbols occurring are normal (cf. 
page 62), G exists by M6. By 7.5 there exists an F over On satisfying the 
equation F‘a — G l (Ff a), which implies that F satisfies 9.3, as is seen by 
the following proof: Suppose a e 20(Jj), i / 0. Then, since D(F t a) = a, 
2)(F! a) t 20 (Ji). Therefore 

G‘(F I a) = &[(F I a) l K[ a, (F f a)‘F 2 a] 

K{a < a and Kia < a, by 9.25, and (FTa)‘/? = F‘/3 if (3 < a, | there¬ 
fore F‘a = G l (F l a) = &[F‘Fi‘a,F‘F 2 ‘ a]. Similarly, if a e 20 (J 0 ), then 
2) (F T a) e 20 (Jo), so that 

F‘a = G‘(F t a) = W(F f a). 

Hence F exists, and by induction it is seen that F is uniquely deter¬ 
mined. The following results are consequences of 9.3 obtained by sub¬ 
stituting for a in the line of 9.3 and applying the equations: 

K{ Jj(a,/3) = a, Jf 2 Ji(a,(3) = ft, which hold by definition 9.24. 

9.31 F‘Ji(/3,7) = {F‘/3,F‘ 7 } 
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9.32 F‘Jl(p,'y)=E-F t P 

9.33 F‘Jg(/3,7) = F‘j9-F‘7 

9.34 F‘(/?, 7 } = F‘/3 • Q“(F‘y), i = 4,5 ,...,8 

9.35 a e 233 (Jo). D. F‘a = F“a 

The last set of theorems show how F reflects the nine fundamental op¬ 
erations of 9.1. 

A set x is said to be constructible if there exists an a such that x — F‘a. 
The class of constructible sets is denoted by L, i.e., 

9.4 Dfn L = 233(F). 

A class A is constructible if all its elements are constructible sets and if 
the intersection of A with any constructible set is also a constructible set, 
i.e., 

9.41 Dfn £,{A). = .:ACL : xeL.D.A-xeL. 

Dfn x,...,z will be used as variables for constructible sets and 
X ,..., Z as variables for constructible classes. 

9.42 Dfn The smallest a such that x = F‘a is called the order of x 
and is denoted by Od‘x, i.e., 

9.421 Dfn {y,x) e Od = (x,y) e F. (z)[z ey D ~(x, z) e F], Od C V 2 . 

9.5 Comp (F u a) 

It is sufficient to prove: F l a C F“a, i.e., all elements of a constructible set 
appear earlier than the set itself. 

Proof: Let a be the first ordinal for which F l a C F“a is false. If 
a e 233 (J 0 ) then F‘a = F“a, hence F l a C F“a. If a e 233(7*), i / 0, then 
a = Ji{/3, 7 ), i ^ 0. By Theorems 9.32, 9.33, 9.34, if i > 1, F‘a C F l /3. But 
(3 < a, by 9.25, so that the theorem holds for /?, that is, F‘/3 C F“/3. Hence 
F l a C F lt 0. Again, since (3 < a, F“{3 C F“a, therefore F‘o: C F“a. 
If i = 1, by 9.31 F l a = {F‘f3,F‘ 7 }, where a = Jj(f3,i). By 9.25, 0, 
7 < a. Therefore F‘/3 e F“a and F‘ 7 e F“a, hence {F l f3,F l 7 } C F“a, 
i.e., F‘a C F u a. 

9.51 Comp ( L ), i.e., any element of a constructible set is constructible. 
(For constructible classes the same thing is true by Definition 
9.41.) 

39 | Proof: Take x e L and let a = Od‘x, so that F‘a = x. Then by 9.5, 

x C F “a. Hence x C L, since F“a C L. 

The following statement follows from 9.5: 
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9.52 If x e y, and x, y e L, then Od‘x < Od‘y. In other words 
x e F‘a. D. Od‘x < a. 

3 i,... ,#8 yield constructive sets if applied to constructible sets, i.e., 

9.6 &(x,y)eL, i— 1,... ,8. 

Proof: There exist /3 ,7 such that x = F l /3, y = F‘ 7 ; 9.31 to 9.34 give the 
result. 

9.61 x-yeL 

Proof: x ■ y = x — (x — y)\ then 9.6 for * = 3 gives the theorem. 

9.611 Od‘x < co a . Od l y < co a :D.Od‘(x ■y)<co a 
Proof by 9.26. 

9.62 x,y e L , = .(x,y) e L and x,y,z e L. = .{x,y,z) e L. 

Proof: The implication in one direction results from expressing (a:, y) as 
{{ 2 }, {£,y}}, then applying 9.6; and the reverse implication is a conse¬ 
quence of 9.51. 

9.621 (x,y) e L. = .{y,x) e L 

(x,y,z) e L. = .(z,x,y) e L. = .{x,z,y) e L 

(follow immediately from 9.62) 

9.623 QiX e L for * = 5, 6 ,... ,8 

(follows from 9.62, 9.621) 

9.63 iCi.D. (3y)[x C |7] 

Proof: Consider Od“x, which is a set of ordinals; by 7.451 there exists an 
ordinal a greater than every element of Od“x, i.e., such that Od u x C a. 
Moreover, such an a can be found with the additional restriction that 
a e W (J 0 ) (e.g., by taking Jq ( 0 , a) instead of a, since Jo ( 0 , a) > a by 
9.25), hence F‘a = F“a by 9.35; but x C F“a, hence x C F l a, and F‘a is 
a constructible set. It follows that a constructible class which is a set is a 
constructible set, i.e.: 


| 9.64 m(X).D.X e L. 
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Proof: By 9.41 and 9.63, X is contained in some y. Therefore X ■ y = X, 
but X -y is a constructible set by 9.41. 

9.65 £(x) 

Proof: By 9.51, x C L; by 9.61, x ■ y e L for any y. 

9.66 x + y e L 

Proof: There is a z such that x 4- y C z, by 9.51 and 9.63. x + y = 
z — [(z — x) — y]. Hence 9.6 gives the theorem. 

9.8 Oel 

Proof: 0 = x — x, hence constructible, by 9.6. 

9.81 £(L) 

Proof: 1C1; and because of 9.51, x ■ L — x, hence x • L e L. Therefore 
£(L) by 9.41. 

9.82 Z(E-L) 

Proof: E ■ L C L; also x ■ E e L by 9.6, since X ■ E is a fundamental 
operation; but x ■ E = x ■ E ■ L because x C L\ hence x • E ■ L e L, and so 
by 9.41, £ (E ■ L). 

9.83 £(A-B) 

Proof: A — B C L; moreover x • A — x ■ B is constructible, by 9.41 and 9.6; 
but x ■ A -x ■ B = x ■ (A - B); hence x • (A - B) e L, so that £ (^4 - B), 
by 9.41. 

Similarly: 

9.84 2(1-B), 
and 

9.85 £(A + B). 

9.86 £(QiA), i = 5,..., 8 

and 


9.87 


£{L ■ Q4A). 
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The last two theorems are proved as follows: Q 5 , ...,Q g take constructible 
sets into constructible sets, by 9.623; therefore | QfA C L, i = 5,..., 8. 
In order to prove that x. QiA e L for i — 4,..., 8, consider an arbitrary 
y ex ■ Ql‘A, i — 4,... ,8. y is an image by Qi of some element of A; take 
the element y' of A of lowest order of which y is an image. The totality 
of these y ' for all elements y of x • QiA is a set u of constructible sets and 
u C A. By 9.63 we have u C z, for some z. z can be determined so that 
z C A, merely by taking z ■ A. Hence we can assume u Cz C A. Therefore 
x ■ Q“z Cx ■ QiA by 4.86; but also x ■ QiA C x • Ql'z because any element 
of x ■ Qi A has an original in u, hence in z. Hence x ■ Qi A = x ■ Q?z, but 
(x ■ Qi‘z) e L by 9.6. 

By means of Theorems 4.92 to 4.96, Theorems 9.86 and 9.87 take the 
following three forms: 

9.871 £[£>(1)]^ 

9.872 £ [€m k (A)]_ for k = 1,2,3, 

9.873 £[L-(VxA)]. 

9.88 £(Axl) 

Proof: By 4.871, A xl3 = (V xB) ■ (A x V) = L ■ {V xB) ■ L ■ (Ax V), 
because^ A x B C L, by 9.62. By_9.8_73 and 9.872, £[L • (V x B)] and 
£ [L ■ (A x V)\. Hence, by 9.84 £(AxB). 

9.89 £[2U(I)] 

Proof: 20 (A) = D (A), hence the result follows from 9.871 and 9.872. 

9.90 £[ZrB] 

Proof: A^ B = A-(V x B) = A- L- (V x B), hence the theorem, by 9.873 
and 9.84. 

9.91 £f4“H] 

Proof: A“B = W (A? B), hence the theorem, by 9.89 and 9.90. 

9.92 £({Z,y}) 

Proof: By Definition 3.1 {X, T} is either 0 or {X} or {Y} or {X, T}, 
where now only sets can appear within the braces. Hence the theorem, by 
9.6, 9.65, 9.8. 

Not all operations on constructible classes give necessarily constructible 
classes. For example, it cannot be shown that £pP(X)]. 
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Now consider the model A obtained as follows: 

1. Class is construed as constructible class. 

2 . Set is construed as constructible set. 

42 3. e ( , the membership relation, is to be the e-relation | confined to con¬ 

structible classes, i.e, X Y , = . X e Y. 

The operations, notions and special classes defined so far can be rela¬ 
tivized for this model A byreplacing in their definition or defining postulate 
the variables X.Y.... by X, Y, ...; the variables x, y, ... by x, y,. ..; e by 
e ; ; and the previously defined concepts and variables by the corresponding 
relativized ones, leaving the logical symbols (in particular also =, which 
is considered as a logical concept) as they stand. The relativized of a 
variable y is a variable whose range is obtained by relativizing the notion 
which defines the range of y. Note that for an operation or special class the 
relativized need not exist a priori, because the theorem which states exis¬ 
tence and unicity (cf. page 12) may not hold in the model A; furthermore 
the relativized concept may depend on the particular definition which we 
chose, since equivalent definitions need not be equivalent in A. (However, 
as soon as we have proved that the axioms of E hold for A, we know that 
the relativized always does exist and does not depend on the particular 
definition.) If the relativized of a defined class A, operation 21, notion *8, 
variable y exists (which presupposes that also the relativized of any symbol 
occurring in its definition exists), we denote it by A[, 21;, 23;, y ; (hence :r / , 
Xi have the same range as x, X). 21; and 03; are defined for constructible 
classes as arguments only, and we have the theorem: 

10.1 If Ai and 21; exist, then Ai is constructible and 21 i(Xi ,..., X n ) 
is constructible for any Xi,..., X n . 

Evidently the relativized classes, notions and operations are at the same 
time classes, notions and operations of the system E, if the requirement 
on page 11, that they be defined for any classes as arguments, is met, e.g., 
by stipulating that 21; (Xi,..., X n ) — 0 and 03; (Xi,..., X n ) is false, if 
Xi,..., X n are not all constructible. 

10. Dfn A special class A or operation 21 or notion 03 is called 
absolute if A.;, 21;, or 03r exists^ respectively,_and A±_ = A, 
21; (Xi,.. -, Xn ) = 2l(Xi,...,X„), or 03, (X_i,..., X „), = 

. 03 (Xi,..., X n ), respectively, for any Xi,...,X n . A variable 
y is called absolute if the range of y ( is the same as the range 

of y- 
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By Theorem 10.1 we have: 

10.11 If A (the operation 21) is absolute, then A is constructible 
(21 (Xi,..., X„) is constructible for any X ±,..., X n ). 

Concerning the meaning and purpose of the metamathematical notions 
of relativization and absoluteness, cf. page 1. The relativized of a propo¬ 
sitional function (p or a proposition ip | is denoted by <p r ip { , respectively, 
and obtained by replacing any concept and variable occurring in it by the 
relativized one (presupposing that they all exist). In particular also the 
relativized of a theorem is quoted by putting a subscript l to its number. 

10.12 e is absolute. 

This is true by definition of e { . 

10.13 “C” is absolute. 

Proof: X C, Y. = .(u)\ue l X. D.«c,y]. = . (u)[u e X. D .u e ¥]. Also 
X CY (u)[w e X .D.u e Y], If (u)[w e X .D.ue F], then in particular 
(u)\u e X. D .u e Y]. On the other hand, the reverse implication holds, 
sino^ if u is not in L, the condition holds vacuously, because the hypothesis 
ue X is false. Therefore X Cj Y. =. X C Y. 

10.131 X C; Y .Y Q X :D .X — Y, i.e., the relativized axiom of 
extensionality holds. 

Proof by 10.13 and the axiom of extensionality. 

10.14 “c” is absolute. 

Proof: IC|F.E:Ic,F.I/F=.IcF,by 10.13. 

Similarly: 

10.15 fiy is absolute, i.e., (A'jF) . = .<£p(X,Y). 

10.16 <&n is absolute, i.e., Cm* (X) .s. <£nx(X). 

10.17 The operation {X,y} is absolute. 

Proof: By 3.1 (X, Y }; is the constructible class Z such that 
(u)[ueZ. = :u = X.W.u = Y]. 

{X,Y} satisfies this condition on Z because it satisfies it even with (u) 
instead of ( u ). Moreover, {X, Y} is constructible, by 9.92. Also {X,Y} is 
the only constructible class satisfying the condition (by 10.131). Hence the 
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relativized of the operation {X, F} exists and {X,Y}i = {X. Y} for any 
X, F, i.e., {X, Y} is absolute. 

10.18 If <£ is defined by € (X) =21(05 (X)) and 21 and 05 are absolute, 
then (£ is absolute. 

Proof: 21(05 (X)) = 2t_(05/ (X)),_but 05; (X) is constructible by 10.1, hence 
21(05; (X)) = 21; (05; (X)) = £; (X). 

This principle holds also for operations with more than one argument. 

44 | 10.19 The operation (X, Y) is absolute. 

This is an immediate consequence of 10.17 and 10.18. Similarly: 

10.20 The operation (X,Y,Z) is absolute. 

10.21 iln is absolute. 

Proof: Urii (X). =. (u, v, uJ)[(U, «); e ; X. (w, u)i X : D. v = «J]. 

By 10.12 and 10.19 the subscript l can be dropped wherever it appears on 
the right. The condition is now equivalent to that obtained by replacing 
u,v,W , by u,v,w, respectively, as in the proof of 10.13 (using 9.62). 

10.22 9JI is absolute and is absolute. 

Proof: Tli (A). =. X e L, by definition of the model A on page 41, therefore 
9J1; (X). = .911 (X), by 9.64 and Axiom A2. Hence also ~9Jt; (X). = . 
~9H(X). 

Not all concepts can be proved to be absolute; for example, and V 
cannot be proved to be absolute. 

10.23 Vi=L 

Proof: Vi is defined by the postulate (x)[x e V{\. L satisfies the condi¬ 
tion, hence L = Vp because of the relativized axiom of extensionality and 
because Z(L) by 9.81. 

10.24 0 is absolute. 

Proof: (^)[~* e 0], and 0 is the only constructible class satisfying this 
postulate. 
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| Chapter VI 45 

Proof of the axioms of groups A-D for the model A 

Every notion and operation appearing in the axioms has now been shown 
to be absolute. This facilitates the proofs of the relativized axioms, since in 
forming the relativized of a proposition all absolute notions and operations 
can be left as they stand, because by 10.1 only constructible classes can 
appear as their arguments, so that the relativized axioms may be formed 
merely by replacing X by X and x by x. For convenience we list the axioms 
in their relativized form: 

Ali £(% 

2 1 XeV.D.m(X), 

3 1 (u)[ueX. = .ueY].D.X = Y, 

4 1 ( x,jj){3z)(u)\uez : = :u = y.V.u = x ]; 

Bl ( (BA){x,y)[(x,y) e A. = .x ey\ 

2; (A,B)(3C)(x)\x e C . = :x e A.x e B], 

3/ (A)(3I?)(x)[x e I?. = .~x e A], 

4/ (X)(3B)(x)[x eB. = . (3f7)[(l/,x)_e A]], 

5 ( (A){3B){x,y)[(y,x) eB. = .xe A], _ 

6 1 (A)(3B)(x,y)[(x,y) e B .=^(y,x) e A], 

7 1 (A)(3B)(x,y,z)[{x,y,z) eB. = , (y,z,x) e A], 

8 , (A)(3B)(x, y, z)[(x, y,z)eB=.(x, z , y) e A]; 

Cli (3a){~<&n(a). (x)[a: ea.D. (3 y){y ea.iC j/)]}, 

2 1 (x)(3y)(u,v)[uev.v ex :D.uey], 

3j (®)(3 y)(u)[uCx.D.uey], 

4 1 (x,A)^n(A) .D.(3y)Jji)\uey^ = .(3v)(v ex ,{u,v) e A)]}; 

D; (A). D. (3x)\x e A. <2y (3:, A)]. 

Alz is Theorem 9.65, A2/ is immediate from A2, A3; holds by 10.131, 

A4/ is satisfied for z = {x, y}, which is constructible by 9.6. Now we 
prove Bl-8; by exhibiting in each case a constructible class satisfying the 
conditions, as follows: 

Bl; Take A = E ■ L. The class E L is constructible by 9.82 and 
satisfies (x, y) e E ■ L . =. x ey, because (x,y) e E . =. x ey and {x, y) e L. 

2/ Take C = A ■ B. This class is constructible by 9.84 and satisfies 
B2. 

3( Take B = L — A. This class is constructible by 9.83, 9.81 and 
satisfies B3. 

4 1 Take B = J) (A). By 9.871 3) (A) is constructible. 
x e B . =. (3 y)[{y, x) e A]. Therefore, in particular, 

| x e B. =. (3 y)[{y, x) e A]. =. (3j7)[(l/, x) e A]. 
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The last equivalence holds, because, if there exists a y it must be con- 
structible by 9.62. 

5; Take B = L - (V x A). B is constructive, by 9.873. 

(x,y) e B . =. {x, y) e L_.y e A; therefore (x, y) e B. =. (x, y) e L. y e A so 
that (x, y) e B . =. y e A^since (x, y) e L, by 9.62. 

6 ; Take_B = £no(A)._B is constructible, by 9.872. 

{x, y) e £no (A). =. {y, x) e A; therefore, in particular, 

(x, y) e £ttt) (A). =. {y, x) e A. 

Axioms B7-8j are proved in the same manner. Now consider Axioms 
Cl-4j: 

1; Cl; is satisfied by a = F l w. 

Proof: ui e 20 (Jo) by 9-27, hence F l u> = F“w. If x e a, (i.e., x = F l a, 
a < w), let /3 be an integer e 20(J 0 ) and > a (e.g., /3 = J 0 ‘ (0, a + 1) by 
9.25 and 9.26) and put y = F‘/3; then yea and y D x because F‘f3 = F“/3 
and F‘a C F u /3. 

Moreover: F‘a e F“f3 but ~(F‘a e F‘a) so that F‘a C F l f3. 

21 Consider @(x); this is a set of constructible sets by 5.122 and 

9.51. Therefore, by 9.63, there is a y such that 6 (x) C y. Hence 

(u,v)[u e v .v ex :D.u ey\, therefore 

(u,v)[uev.v (x:D.uey\, 

that is, y satisfies the condition of C2;. 

3; Consider L ■ ty(x) (which is a set by 5.121) and take y such 

that L ■ (x) C y, by 9.63. Then u e L. $ (x). D . u e y. Therefore 

u e L ■ ip (x). D. u e y, so that u e (x). D. u e y, that is ,uCx.D.uey. 

4; Take y = A“x. y is constructible, by 9.91. 

u e y . = . (3u)[u ex.{u,v)e A ], 

therefore, in particular, 

uey. = .(3v)[vex.(u,v) e A]. 

Now, if there is a constructible v, there is a v satisfying the condition; on 
the other hand, if there is a v, v will be constructible, since vex. Therefore 

uey. = .(3v)[vex.{u,v)e A). 

Finally, consider Axiom D;. By Axiom D, (3ar)[a: e A . (x, A)]. But x 

is constructible, since x e A. Hence there is an x satisfying the condition. 
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Since all axioms of E hold in A, it follows now that all theorems proved 
so far also hold in the model A, except perhaps those based on the axiom 
of choice. Therefore the existence and unicity theorems necessary for the 
definition of the special classes and the operations introduced so far also 
will hold in A, and, as a result, the relativized of every concept introduced 
so far exists (except those definitions marked by *, which depend on the 
axiom of choice); in particular also £; and Li exist. 


| Chapter VII 

Proof that V = L holds in the model A 

In order to prove that the axiom of choice and the generalized continuum 
hypothesis hold for the model A, we shall show: (1) that both of them 
follow from the axioms of E and the additional axiom V = L (which says 
that every set is constructible) and (2) that V = L holds in the model A, 
i.e., Vi = Li. We begin with item (2). Since Vi = L by 10.23, it is sufficient 
to prove Li — L, that is, the class of constructible sets is absolute. To that 
end, it will be shown that all operations, etc. used in the construction of L 
are absolute. 

A general remark for proofs of absoluteness will be useful. In order for 
the operation 21 (Xi,..., X n ) to be absolute it is sufficient to show that 

(1) 2t gives constructible classes when applied to constructible classes, 
and 

(2) 21 satisfies the relativized defining postulate, i.e., if 21 is defined by 
</>(2l(Xi,..., X n ),X u .. .,X n ), then <hQL(X lt ... ,X n ),X u ... ,X n ). 

It is easily verified that (1) and (2) are sufficient, namely, as follows: 
21; exists, since the model satisfies the axioms of E. Hence fa has the 
property that for any X\,... ,X n there exists at most one Y such that 
fa(Y,X r,..., X„). 

But 

0/(21 i(X 1 ,...,X n ),X 1 ,...,X n ) 
by definition of 21; and 

fa{^(X 1 ,...,X n ),X 1 ,...,X n ) 
by assumption (2). Therefore 


21, (X u ..., X n ) = 21 (Jfi,..., X n ). 

Similarly for the particular class A it is sufficient to show that it is con¬ 
structible and satisfies the relativized postulate. Remember also that by 
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10.18 operations defined by substituting absolute operations into absolute 
operations are absolute. 

11.1 “x” is absolute. 

Proof: A x B is constructible, by 9.88. 

ueAxB=.(3v,w)[veA .w e B .u — (v , iu)] by Definition 4.1. Therefore 
u e A x B . = . (3?;, w)[v e A. w e B. u = {v ,«/)]. Now, in the usual manner, 
the condition on the right is equivalent to that obtained by replacing v, w 
by v,w respectively. Therefore A x B satisfies the relativized postulate; 
hence “x” is absolute, by the remark made above. 

11.11 The operations A 2 , A 3 ,... are absolute. 

48 | This follows from 10.18 and 11.1. 

11.12 Die! and 9M 3 are absolute. 

Proof: DM(Z). = .X C V 2 and SHelj (3f) .1 C tf, by 10.23; but 

X C L 2 . = . X C y 2 , by 9.62. Hence 9Wj (X). = . 9Tet (JsT). Similarly for 
93ei 3 . 

11.13 ® is absolute. 

Proof: D (A) is constructible, by 9.871. x e D (A). = .(3 y)[(y,x) e A], 
therefore x e S (A). =. (3y)[{y, x) e A]. In the usual way, the last condition 
is equivalent to that obtained by replacing y by y, so that D (A) satisfies 
the relativized postulate. 

11.14 is absolute. 

Proof: A - B is constructible, by 9.84. x e A ■ B .= : x e A. x e B; therefore 
x e A ■ B , = :x e A.x e B, that is, A • B satisfies the relativized postulate. 

11.15 CnOfe is absolute (k = 1,2,3). 

Proof: £m k (A) is constructible, by 9.872. Consider, e.g., CnDi (A). It 
satisfies the condition 

fHel(£nD 1 (A)). (x, y)[(x, y) e £ra>i(A) . =. {y, x) e A] 

by definition. This condition implies the relativized statement by 11.12. 
Similarly for CnOfc (A). 
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11.16 “ 1 ” is absolute. 

Proof: A 1 B = A-(V x|) and A t = A- ( LxB ). But A • (V xB ) CLxL 
by 9.62; therefore At B_= A - (V x B) ■ (L x L) = A ■ (L x B), by 4.87. 
Therefore A 1 B — A 1 iB. 

11.17 “233” is absolute. 

Proof: 2P (A) = D (<£nt) (A)) by definition. Hence the theorem by 10.18, 
11.13, and 11.15. 

11.18 The operation A U B is absolute. 

Proof: A“B = 233 (A 1 B), by definition. Hence the theorem by 10.18. 

11.181 The relativized operation of the complement is L — X. 

Proof: L — X is constructible by 9.81, 9.83 and yeL — X. = .~yeX. 

11.19 The operation A — B is absolute. 

| Proof: A-iB = A-(L-B) = A L-(-B) = A-(-B) = A-B. 

11.20 “+” is absolute. 

Proofi A + t B = L - [(L - A) ■ (L - B)] = L - [L - (A + S)] = A + B (since 
A + BCL). 

11.21 Ei = E ■ L 

Proof: E ■ L is constructible by 9.82. Also 

m ( EL ). (x, y)[<®, y) eEL. = .xey], 

since SHel (E ■ L) and since (x, y) e L and (x,y) t E . = .x e y. Therefore 
E ■ L satisfies the relativized postulate. 

11.22 £2 is absolute. 

Proof: fo, (A,^) = X -t E t = X ■ L ■ E = X ■ E - (X,Y), by 11.14, 

11.21. 

11.221 All the fundamental operations & (i — 1,2,..., 8) are absolute. 




84 


Go del 1940 


The proof follows from 10.17, 11.22, 11.19, 11.16, 11.13, 11.15, respectively, 
using 10.18 and 11.14. 

11.23 The binary operation A‘X is absolute. 

Proof: Since any y satisfying (y,X) e A is constructible by 9.51, we have: 
if there is exactly one constructible set y such that (y,X) e A, there is 
exactly one set, and vice versa. Therefore ( A‘)iX = A‘X in this case; in 
the contrary case both are 0. 

11.3 Comp is absolute. 

Proof: 

Comp(X). = .(u)[ueX.D.uCX|. 

= .(u)[ueX.D.uC X]. = .Comp, (X). 

11.31 £)rP is absolute. 

Proof: 

DtP (X).ee: Comp (X ). ( u , v)[u, v e X : D 

:.u = v.V.u e v.V.v e u]. 

=: Comp; (X ). (%v)\u,ve X : D 

:.u = v.V.uev.V.veu]. 

=: Dtp; (X). 

The first and last equivalences follow immediately from the definition of 
Dtp and Dtp;. 

11.32 £> is absolute. 

Proof: 

D(X). = :DtO(X).m(X) : = : DiOi (X) .Wh (X). 

= :Oi(X), by 11.31,10.22. 

50 | 11.31 says that the ordinals of the model A are the same as the ordinals 

which belong to the model A. This does not mean that the ordinals of the 
model are the same as the ordinals of the original system, since nothing is 
said of those ordinals which may not belong to the model (i.e., may not be 
constructible). Cf. however 11.42. 

11.4 “^nc” is absolute. 


Proof: 


ync;(y).= :lHe[;(y).lXn;(T). 

= : 5Kel(F) .iln(F) by 11.12,10.21,4.61. 
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11.41 “(?n” is absolute. 

Proof: 

=: ^nc(F). T>(Y) = X by 11.4,11.13,4.63. 

11.42 On is absolute. 

Proof: Dt®/ ( Oni ) by 7.16/ and ^Jt/(On/) by 7.17/. But On/ is constructible 
by 10.1, hence Drh (On,/) and (On/) because Dtt) and are absolute by 
11.31 and 10.22. Hence On/ = On by 7.2. 

By 10.11, it follows from 11.42 that On C L; in other words, every 
ordinal number is constructible. Furthermore 11.42 implies: 

11.421 The variables a, /?,... are absolute. 

11.43 “<” is absolute. 

Proof: “<” is by definition the same as “e”. 

11.44 “<” is absolute. 

Proof: X < Y is by definition X e Y .V. X — Y. 

11.45 “ + 1” is absolute. 

Proof: 7.4, 10.17, 11.20 and 10.18. 

11.451 Each of the symbols 0,1,2,3,..., etc. is absolute. 

Proof by 10.24 and 11.45. 

11.46 6 (and therefore OJfaj: and £im) is absolute. 

Proof: 

ze6(I) =.(3«)[^«.!/eI] =.(3iJ)[z f )J.iiel] =.^6/ (X). 

Therefore & (X) = 6/ (X) by the axiom of extensionality. 

| What is left now is to show that the special classes R, S, J, K\, K 2 , 
F, and finally L , are absolute, where R is the ordering for pairs defined in 
7.81, S is the ordering of the triples (f, o, fj) defined by 9.2, and F is the 
function introduced by 9.3 which defines L. For each of these the proof of 
absoluteness will be based on the following lemma: 
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If the class A is defined by the postulate <p{A) and if all defined classes, 
operations, notions, and variables appearing in <j) are absolute, then A is 
absolute. 

Proof: If <j) satisfies the condition above, then . = .<fi(X). Also 4>i{Ai) 
and 4>{A) by definition of At and A. Since by 10.1 Ai is constructible, 
4>i{Ai) implies <j>(Ai), hence At = A, because both <f>(Ai) and <j){A). 

11.5 “ R ” is absolute. 

Proof: By Definition 7.81 we have 

R C (On 2 ) 2 . (a, 0, 7 , «)[«a, 0), ( 7 , 6 )) e R. at 

::.9Hap{a,^} < 9 Hctj:{ 7 , £} .V :: fUtap{a, 0} = { 7 , <5} :. 

0 < 6 ,V : /3 — 6 .a < 7 ]. 

The following concepts appear in the defining postulate: C, On, 2 , (), QJlay, 
{ }, <, e, and variables a, /?,..., all of which have been proved absolute by 
10.13, 11.42, 11.11, 10.19, 11.46, 10.17, 11.43, 10.12, 11.421, respectively. 

11.51 “5” is absolute. 

Proof: By Definition 9.2 we have 

S C (9 x On 2 ) 2 .(a,P, 7 , 6 ,n,v){n <9.u < 9:D 

:: <( M ,a,/?>, {u, 7 , 6 )) e S. =:. ((a,0), ( 7 , 6 )) eR.V 
: (a, 0 ) = ( 7 v}. 

In the postulate for S the following concepts appear, other than those 
appearing previously in the postulate for R: x, R, 9, which are absolute 
by 11.1, 11.5, 11.451, respectively. 

11.52 “J” is absolute. 

Proof: By Definition 9.21 we have 

J#n(9 x On 2 ).W{J) — On. (a, 0, 7 , 6 , fi, n)[fi, v < 9. D 

: (^a,p)S(p, 7 ,6).D.R{fi,a,p) < J‘(i/,7,«)]. 

The only additional symbols in this postulate are: gti, 2U and ‘, all of which 
have been proved absolute by 11.41, 11.17, 11.23, respectively. 

11.53 Each “Jj” is absolute, i = 0, 1 , 2 ,..., 8 . 
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Proof: Jo (a, 13} = J‘(0,a,/?). J 0 3fiOn 2 . Here there are no symbols but 
those mentioned before. Similarly for ..., Jg. 

| 11.54 K\ and K 2 are absolute. 52 

Proof: In the defining postulate 9.24 there are no symbols but those men¬ 
tioned before. 

11.6 “F” is absolute. 

Proof: The only additional symbols appearing in the defining postulate 9.3 
are 1 and 5 i, • • • , 5 s> which are absolute by 11.16, 11 . 221 , respectively. 

11.7 “L” is absolute. 

Proof: L = 20(F), and 20, F are absolute by 11.6, 11.17. 

It has now been demonstrated from the axioms of E that Li = L, hence 
also that Vi = Li, i.e., that the proposition V = L holds in the model A . 
This proves that, if there exists a model for the axioms of groups A, B, C, 

D, there exists also a model for the augmented set of axioms obtained by 
adding as an axiom the proposition V = L, namely, the model consisting 
of the classes and sets “constructible” in the given model for E. Thus if 
the system A, B, C, D is consistent, the augmented system is consistent. 
Another way of putting this argument is as follows: If a contradiction were 
obtained from V = L and the axioms of E (i.e., the axioms of groups A, 

B, C, D), then the same contradiction could be derived also from Vi = Li 
and the relativized axioms A/, Bj, C;, D*. But Vi = Li and A i, B;, C;, D; 
can be proved in E as shown before; hence E would be contradictory, and 
a contradiction in E could actually be constructed if a contradiction from 
E and V = L were given. 


| Chapter VTII 53 

Proof that V = L implies the axiom of choice and 
the generalized continuum hypothesis 

Now it remains only to be shown that the axiom of choice and the 
generalized continuum hypothesis follow from V = L and E. 

For the axiom of choice this is immediate since the relation As defined 
in 11.8, which singles out the element of least order in any non-vacuous 
constructible set, evidently satisfies Axiom E if V = L. 
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11.8 Dfn (y,x) e As . = : y a x ,(z)[Od‘z < Od‘y .D.~z e x\.9\z[(As) 

As‘x is what may be called the “designated” element of x. 

11.81 Dfn C‘a = Od i [As‘{F‘a)].C'SnOn 

C'a is the order of the “designated” element of F‘a. Hence C‘a < a by 
9.52. 

The rest of these lectures is devoted to the derivation of the generalized 
continuum hypothesis from V = L and the axioms of S. Since we have 
just derived the axiom of choice from these assumptions, we are justified in 
using all starred theorems and definitions in this derivation. The theorems 
which follow from now on are only claimed to be consequences of S and 

V = L. However, only 12.2 really depends on V = L\ in all the others 

V = L is not used, and even the axiom of choice could be avoided in their 
proofs, if one wanted to. 

12.1 F 7 ^ = uj a 

Proof: F“w a < ZJ a — w a by 8.31. On the other hand, there exists a subset 
of w a , namely u> a ■ 23 (Jo), such that the values of F over this subset are 
all different. For if 7 ^8 and 7 ,6 e ui a ■ 20(Jo), assume 7 < 8; then 

F ‘7 e F‘S, by 9.3, so that F ‘7 ^ F‘6. But u> a -W(J 0 ) > w a , because 

Joi^a) — w oi • 20 (Jo) by 9.26 and J 0 is one-to-one. Hence F“ui a > ui a . 

By 12.1 the generalized continuum hypothesis follows immediately from 
the following theorem: 

54 I 12.2 <P(FV)cr w a+1 . 

This theorem is proved by means of the following lemma: 

12.3 If m C On and m is closed with respect to C, K\. K 2 and 

with respect to as triadic relations and if G is an 

isomorphism from to to an ordinal o with respect to E, then 
G is also an isomorphism with respect to at/3[F‘a e F‘/3 ], i.e., 
a, p e to D [F l a e F‘(3 . =. e F l G l 0\. 

We show first that 12.3 implies 12.2. 

Proof: Consider u e i0(F“w Q ), that is, u C F“uj a . By V = L there is a 
6 such that u = F‘6; form the closure of the set uj n -I- {<5} with respect 
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to C, K i, K 2 , and with respect to Ji, i = 0,1,... ,8, as triadic relations, 
according to 8.73, and let the closure be denoted by m. Now, by 8.73, 
m is a set and m = w a . Furthermore, m is a set of ordinals; hence m is 
well-ordered by E by 7.161 and is isomorphic to some ordinal number o by 
7.7. Let the isomorphism be denoted by G, so that G“m = o. For brevity, 
let a' denote G‘a. By Lemma 12.3 we have: 

a, /? e m . D: E‘a e F l (3 . =. F‘a' e F‘/3'. 

Now consider S', the image of 8 by G. S' e o, that is, S' < o. Since G 
is one-to-one as an isomorphism, o = m — u a , from which it follows that 
o < w a |j, hence 8 ' < w Q _j_ r Also, for any (3 e m, 

F‘f3eF l 6. = .F‘0' eF l 8’. 

D a C m, by definition and ui a is complete (as an ordinal number). There¬ 
fore u a is an .E-section of m; hence uj a is mapped by G on an E-section of o, 
i.e., by 7.21 on an ordinal number. But, by 7.62, this can be only the iden¬ 
tical mapping of uj a onto itself. Therefore, if /?« w a , then (3 ' = (3. Hence 
F‘f3 e F l 8. =. F l (3 t F‘8', for f3 e u) a ; that is, F l 8 and F l 6' have exactly the 
same elements with F“u> a in common, i.e., F‘8 ■ F“u> a = F‘8 1 ■ F“u> a ; but 
F l 8 C F , by assumption; therefore F l 6 = F‘8' ■ F“oj a . But e 2U (J 0 ) 
by 9.27, therefore, by 9.35, F“w a = F l u a , hence u = F‘8 = F‘8' ■ F‘uj a . 
Therefore by 9.611 Od‘u < <^ a ^ v in other words, u e F“oJ a + v q.e.d. 

In order to prove 12.3, we prove at first the following auxiliary theorem: 

12.4 From the hypothesis of 12.3 (leaving out closure with respect to 
G) it follows that 

(1) G is an isomorphism for the triadic relations Ji(i = 0,..., 8), 
i.e. (if G l a is abbreviated by a'): Ji(a',/3') = [Ji {a,/3)j for a, 

(3 e m, i < 9, and 

(2) o is closed with respect to the triadic relations Ji. 

In outline, the proof runs as follows: By definition of J and the closure 
property of m, J establishes an isomorphism with respect to S and E 
between the class of triples (i,a,/3), i < 9, a,f3 e m, and m. By G this 
isomorphism is carried over to an isomorphism be|tween the set t of triples 55 
( i,a,f3 ), i < 9, a, f3 e o, and o. But J likewise defines an isomorphic 
correspondence between t and some ordinal 7, also with respect to S and 
E; from this it is inferred by 7.62 that 7 — o and that J confined to t 
coincides with the image by G of J confined to 9 x m 2 . But this is what 
the assertion of the theorem says. The detailed proof is as follows: 

Set j = J 1 (9 x m 2 ). Then we have X) (j) = 9x m 2 . Now 2H (j) C m, 
since m is closed with respect to all the But also m C 233(j); for 
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suppose 7 era, then 7 = J l (i, a, P) for some i, a, ft where a, (3 e m, since m 
is closed with respect to Kj and K 2 ; hence 7 6 2D (j). Therefore 2D (j) — m. 
Moreover, 

i,k <9.a,P,j,8 em. (i , a,P)S(k, 7 , 6 ) a, ft) < j l (k, 7 ,#), 

since J has this property, and since for this domain J coincides with j. 
Therefore 

itemise (9 x m 2 ,m). 

Now denote by j the function into which j is carried over by G, that is, j 
is defined by: 


33 * 1(9 x o 2 ) .j l (i,a',P') = \j l (i,a,P)J, 

for a,P e m and i < 9. This may be written j l (i,a,P) — [j‘(i,aii,Pi)]', 
for a,P e o, i < 9, where G l a is denoted by 07 . We want to show that 
j3(omsE (9 x o 2 ,o). Now: V(j) = 9 x o 2 and 2D (j) = o, because j has 
the corresponding properties. Since G is an isomorphism with respect 
to E, it follows by Definition 7.8 that (a,P)Le('y, 6 ). = . (a 1 , p^Le^', 6') 
for a, p, 7 , 6 e m. Likewise, by Definition 7.81, (a,P)R( 7 , 6 ). = . 
(a',P')R(')\ 6 ') for a,P, 7,6 e m. It follows then by Definition 9.2 that 

(*, a x> Pi)S(k, 71, «i> • = • ( i, a, P)S(k, 7, S) 

for a, P, 7 , 6 e o and i, k < 9. Now suppose a, P, 7 , S e o, i, k < 9 and 
(i,a,P)S(k,'y, 6 ). We have then (i,ai,Pi)S(k, 71 , 61 ), which implies, since 
, 7 'Dfom 5 s (9 x ro 2 ,m), that j l (i,ai,Pi)Ej l (k,ji,Si). Now, since G is an 
isomorphism with respect to E, we conclude that 

\j l (i, auP^j’Elj^k, 71 , < 5 i)]'> 

that is, j l (i,a,P)Ej‘(k, / y, 8 ). Therefore j CJforrise (9 x o 2 ,o). 

Now define j 0 = J ! (9 x o 2 ). Then D(j 0 ) = 9 x o 2 and 2D (j 0 ) is some 
ordinal number 7 , since 9 x o 2 is an 5-section of 9 x On 2 . Therefore under 
J the image must be an E-section of On, i.e., an ordinal by 7.21. Hence 
both j 0 3Jom SB (9 x o 2 , 7 ) and j Dfoms# (9 x o 2 , o); but there can exist but 
one isomorphism of this kind of a set on an ordinal number, by 7.62, hence 
7 = 0 and j 0 = j. Therefore 

jo (i, a', p') = f(i, a', 0) = [j l (i, a, P )]', 


for a,P e m, i < 9, which is equivalent, by the construction of j a and j, 
to the statement: J‘(i, a',/3') = [J 1 (i, a, P)]', for a,p e m, i < 9, which, 
in turn, is the same as: Jj (a 1 , P') = [J*‘ (a, P)]', for i = 0 ,..., 8 , a, P e m, 
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which is what we set out to prove. That o is closed with respect to the J, 
follows immediately from the last equality. 

12.4 can be stated symmetrically as follows: 

12.5 If m C On, m' C On, m, w! both closed with respect to Ki, K 2 

and the Jj as triadic relations and if | C (m, m'). then G 56 

is an isomorphism for the triadic relations J*. 

The proof is obtained by mapping m and vn! on the same ordinal o by 
7.7 and then applying 12.4. 

12.51 The hypothesis of 12.5 implies, furthermore, 

aeW{Ji).D.G‘aeW(Ji) for a e m, i = 0,... ,8. 

Proof: a e 2U(Jj) implies a — (3,jem, since m is closed with 

respect to Ki,K 2 . Hence a' = Ji(a',0) by 12.5; hence a' e 2D(Jf). 

Next it will be shown that: 

12.6 If m, m !, G satisfy the hypothesis of 12.5 and in addition m and 
m' are also closed with respect to C, then G is an isomorphism 
for the relations a/3(F l a e F l (3) and aj3(F l a = F‘/3). In other 
words, 

(a) a, (3 e m.D:F l a e F l /3. = .F l a' e F l /3'. 

F‘a = F73.ee.TV - F‘f3', 

where again G l a is abbreviated by a'. 

The scheme of the proof will be to carry out an induction on rj = 
9Haj:{a, f)}. We will assume as the hypothesis of the induction that (a) 
is true for a, /? e m and a,/3 <rj, and prove it for a,j3em, /?} = rj. 

(Hence the property which is shown by induction to belong to all ordinals 
r) is given by the propositional function: 

(a, 0)[a,pem-ri = 2ttaf{a, 0} : D: (F‘a eF l {3. = . F l G‘a e F‘G‘/3). 

(F‘a = F l p. = . F‘G‘a = F‘G‘/3)]. 

This expression is normal; therefore we can apply induction by 7.161.) If 
OTay {a, /?} = »? there are 3 possible cases, namely 


1) 


a = P = V- 
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In this case the equivalences (a) both hold, since, in the first, both members 
are false, and, in the second, both are true. The remaining two cases are 
a = 77, (3 < 77 and a < 77, f3 = 77. Hence what has to be proved is: 


F l a t F‘r). = .F‘a' e F l rf, 'j 

F‘ri e F l f3 . = .F l r}' e F‘/3 ', i for a,/3<rj, a,/3em, 

F l ri = F‘/3.=.F l ri' = F‘p', J 


under the hypothesis that rj e m and 


I. F'aeF‘p.1 
II. F‘a = F‘^. 


. F‘a' e F'f3', 1 

i.F‘a' = F i P', j 


for a,f3 e m ■ 77. 


Everything which follows from now on up to the end of the proof of 
Theorem 12.6 (in particular the theorems (l)-(9) on pages 57-9) depends 
on this inductive hypothesis in addition to the hypothesis of Theorem 12.6. 


57 The following abbreviations will be convenient: r = F“m, | r v = 
F“(m • r?), r' = F u m', and r'^ = F“(m' • r}'). Hence r v C r and C r’. 
Now we can define a one-to-one mapping H of r v on r' v by H = F|G|F _1 . 
Because of the inductive hypothesis II, H is one-to-one and H'x = F l a' 
if x = F l a, a e m ■ rj. Because of inductive hypothesis I, H is an iso¬ 
morphism with respect to E. Note that the hypotheses of Theorem 12.6 
and the inductive hypothesis are perfectly symmetric in m,m' and 77,77', so 
that whatever is proved from them will also hold if m, 77, r, r v , G, H are 
interchanged respectively with m', 77', r', r' v , G, H. 

The next step will be to show that H is an isomorphism for the triadic 
relation zxy[z = (x,y)] and the tetradic relation zuvw[z = (w, v, vj)], and 
for the Qi. In order to establish this some preliminary results are needed. 


(1) r is closed with respect to the fundamental operations. 

Proof: Take x,y er; then x = F l a, y = F l (3, a, (3 e m, so that &(*, y) e r, 
by 9.31-9.34, since m is closed with respect to the Jj. Therefore x — y, 
{x,y}, ( x,y ), { x,y,z ), and x ■ Qfy are in r if x,y,z e r. In particular, it 
follows that x ■ Qi{y} e r if x, y e r. 


(2) x e r .D .Od‘x e m 

Proof: {x} e r by (1), hence there is an a e m such that {a:} = F‘a. Set 
(3 = C'a. Then f3 e m, since rn is closed with respect to C and f3 = Od l x, 
by definition of C (11.81). 
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(3) (x e r). (x ^ 0):D.x • r ^ 0 

Proof: There is an a e m such that x = F l a. F‘C‘a e x, by Definition 
11.81; but F l C l a e r, since m is closed with respect to C; hence x ■ r ^ 0. 

(3.1) {x,y} er.D.x,yer; {x,y) e r .D.x,y e r; 

{x,y,z) er.D.x,y,zer. 

Proof: It follows from (3) that {x} e r. D .x e r, because x is the only 
element of {x}, also {x,y} e r . D .x,y e r, for, either x or y e r by (3); 
suppose x e r, then {x} e r by (1), hence {x,y} - {x} e r by (1), so that 
{y} e r if x ^ y, hence y e r. By iteration, (x, y) e r. D .x, y e r and 
(x,y,z) er .D.x,y,z e r. It follows then that 

(4) y e r .{y,x} t QiiD.x er for i ^ 5. 

Proof: Consider Qe, the permutation of the ordered pair: If (y, x) e Qe, 
then y = (u,v), (v,u) = x for some u, v. (u,v) e r by assumption, hence 
u, v e r by (3.1) so that (v,u) e r, by (1), | that is, x er. Similarly for the 58 
other permutations, i.e., Qt, Q$. Now consider Q 4 = P 2 -1 : assume y e r, 
{y,x) e Pf 1 ; then (x, y) e P2, i.e., y is an ordered pair and x its second 
member, hence x e r by (3.1). 

There is a weak completeness theorem for r v : 

(5) xer v .yex :D: ye r.D.yer v . 

Proof: Set a = Od‘y. Now aem, by (2); Od l y < Od l x < y, by 9.52; hence 
a e m- rj, that is, y e r v . 

(6) y e F l r).y er:D.y er v 

Proof: Od‘y < r) by 9.52. Od'y e m by (2); hence Od.‘y e m ■ r), i.e., y e r v . 

(7) {x, y} e r v . D. x, y e r v and (x, y) e r v . D. x, y e r v ; 

{x,y,z) err,.D.x,y,zer v . 

Proof: {x,y} e r, therefore x,y e r by (3.1); hence the result follows, by 
(5). By iteration it follows that (x, y) e r v . D .x, y e r n , and similarly for 
triples. 

(8) H is an isomorphism with respect to zxy[z = (x, y}], zxy[z ~ 

(x, y)], zxyt[z — (x, y, t)], and the Qi (i = 4, 5,..., 8). 


(In the sequel H‘x is abbreviated by x'. So the prime is an abbreviation 
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for G or H according as to whether it occurs with a Greek or a Latin letter.) 


Proof: Consider {x, y}; we wish to show that 

x, y, z e r„ . D: z = {x, y} . =. z' = {x', y'}. 

Recalling the symmetry of the hypotheses, and that x,y,z e r n is equivalent 
to x 1 , y', z' e r', it is obvious that it is sufficient to prove implication in one 
direction, in order to establish the equivalence. We prove implication from 
right to left; z' — {x',y'} implies x' e z' and y' e z', hence, since H is an 
isomorphism with respect to E, x e z and y e z, i.e., {x, y} C z. We have 
then only to show that z — {x,y} = 0. Since x,y,z e r, z — {x,y} e r by 
(1); hence by (3), if z — {x, y} ^ 0, there is a u e r such that u e [z — {x, y}]. 
So u e z, and z t r v , hence, by (5), u e r n . Then u e z,u ^ x and u / y, 
hence u' e z', v! ^ x' and u' / y\ because H is one-to-one and isomorphic 
for E. But this means z' ^ {x',y'}, contrary to assumption. 

To establish that H is an isomorphism for z = (x,y), it must be shown 
that 

x,y,z e r v .D:z = (x,y). = .z' = {x',y'). 

Again it is sufficient to establish implication in one direction. Assume 
z = ( x,y ). It follows that z = {u, w}, where u = {x,a;} and v — {x,y}. By 
59 (7), u,v e r v ; hence, forming z', u\ v', x\ y', it follows | that v' — {x', y’}, 

u' = {x',x'}, and z' = {u',n'}, that is, z' = { x',y'). 

For the ordered triple, assume z = { x,y,t ); then z = ( x,s ), where 
s = (y, t); t,s e r v , by (7), since z e r v ; therefore z' = (x',s'), s' = (y',t'), 
that is, z' = ( x',y',t'). 

Consider now Q5 = P2; we must show that 


x,yer v .D:(x, z) e P 2 . = . (x',z') e P 2 . 

As usual, only the implication in one direction is necessary. Assume 
(x,z) e P 2 ; then there is a y such that z = {y, x); by (7) y e r v , there¬ 
fore z' = {y',x') by (8), that is, (x',z r ) e P 2 . Now since H is an isomor¬ 
phism with respect to P 2 , H must be an isomorphism also with respect to 
Q4 = P 2 1 . 

There remain only the permutations Q e , Q7, Qg- Consider Q<>- for 
example. Assume ( x , y) e Q 6 ; then there exist u and v such that x = (u , v) 
and y = (v, u). Since x,y e r r/ , it follows by (7) that u,v e r n , hence 
x' = (■«', v 1 ) and y' = (?/, u') by (8), that is, {x',y') e Qq. The proofs are 
similar for Q 7 and Qg. 
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Now consider the three relations which must be proved to establish the 
induction, namely, 

(9) 1. F‘a e F‘t/. = .F‘c/ e F‘r/> 1 

2 . F‘r/eF‘/3. = .FV eF l 0', > for a,/3em r]. 

3. F l/ q = F‘/3 . =. F l rf' = F‘/3', J 

We shall show now that it is sufficient to prove the first of these three 
relations. Let us assume then that the first is true, and prove the third. 
Assume that F l r] ^ F‘/L Then either F l rj — F‘/3 ^ 0 or F‘/3 — F l r] ± 0, 
and [F l r] - F l 0 ] e r, [F‘/? - F l tj] er by (1). Hence by (1) and (3) there is a 
u e r such that either u e [F l r) — F‘0] or u e [F‘/3 — F l rj]. Therefore u e F l r] 
or u e F l f3, hence in both cases u e r v , by (6) and (5), since F‘/3 e r v . Let 
us now assume u e [F l rf — F‘/3]; then u e F‘r] and ~(tt e F‘f3). Hence by the 
inductive hypothesis I, we have ~(u' e F‘/3'); but also v! e F l rj, because we 
have assumed (9) 1 to be true. Therefore F l r)’ — F l /3' ± 0. Suppose, on the 
other hand, that u e [F‘f3 - F‘r/]; then u e F L f3 and ~(u e F‘r?). Exactly as 
above, we have u' e F‘(3' and ~(u' e F‘r)'), that is, F l rf F l [3'. Thus we 
have shown 

and the inverse follows by symmetry as usual. 

We have now established that the third relation of (9) follows from the 
first. Now we derive the second from the first and third. Assume that 
F‘r) e F‘/?; set a = Od‘F l r). By 9.52 a < /3 < rj and by (2) a e m ■ r). 
So F l r] = F‘a and therefore F‘a e F l /3; from F‘ry = F‘a it follows by (9) 
3 that F'rf = F‘a'; moreover F‘a' e F‘/3', by the inductive hypothesis 
I, hence F‘?/ e F‘/3', that is, F‘»j e F‘/3. D .F l rf e F‘/3' and the inverse 
implication by reasons of symmetry. Therefore it is sufficient to show (9) 
1 and by symmetry it is sufficient to show: 

| F‘a e F l rj. D. F l a' e F l rf for a tm-r). 

So we assume F‘a e F l ri, and consider separate cases according to the 
index i such that r] eW(Ji). 

1. Suppose r] e W(Jo); by 12.51, v[ e W(Jq), hence F‘r/ = F“r/ and 
F l t]' = F‘V, by 9.35, so that both members of the equivalence (9) 1 are 
true, hence trivially equivalent. 

2. Suppose r) e 2D (./i). Then r) — J\ (fj, 7 }, where /3 ,7 c m (by the 

closure property of m) and fi : 7 < r/, by 9.25. Also r/' = J{ by 12.5, 

so that 9.31 gives: F l r) = {F‘/3,F‘ 7 } and F‘?/ = (F‘/3',F‘ 7 '}. Suppose 
F‘q e F‘t); then F‘a -- F‘/3 or F‘a = F‘ 7 ; therefore, by the inductive 
hypothesis II, F‘a' = F‘/?' or F‘a' = F‘ 7 ', that is, 
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F‘a'e{F‘/3',F‘7'}; 


in other words, F‘a' e F‘77'. 

3. If 77 e 2D(J 2 ), then we have, as before, 77 = J 2 (/3, 7) and 77' = 
J 2 (/3',Y>, A 7 By 9.32, F‘77 = E ■ F‘0 and F‘r)' = E ■ F‘(3'. If 

F‘a e F‘y, then F‘a e F l /3 and F‘a e E. It follows that F l a' e F‘/3', by the 
hypothesis I of the induction. From F‘a e E it follows that F‘a = (x, y) 
and x e y for some x,y; F‘a e r T( , hence x,y e r v , by (7), therefore F’a' — 
( x ', y'), by (8), and x' e y', that is, F‘a' e E. Hence 

F‘a' e E • F‘/3'; 


in other words, F l a' e F l y'. 

4. If 77 e 2D (J 3 ), we get in the same fashion, by 9.33, F l r) — F‘(3 - 
F‘7 and F‘77' = F l (3' — F‘7', /?,7 e 771-77. Assume F‘a e F l r 7, and the 
inductive hypothesis I applied to F‘a, with F‘/3 and F‘7, gives F‘a' e F‘77' 
immediately. 

5. Suppose 77 e 2U(J*), i = 4,6,7,8. As above, 77 = </*{/?, 7), 77' — 

/?,7 e 777-77, so P‘»7 - F‘/3 • Qj‘F‘7 and F‘77' = 

F‘/?' • Q“F l 7', by 9.34. Now assume F‘a e F‘77, that is, F l a e F‘/3 and 
F‘a e QiF ‘7. It follows that F l a' e F‘/3'; also by Definition 4.52 there is 
anre F‘ 7 such that (F‘a,x) e Qj. Now x e r by (4) and x e F‘ 7 e r v , 
hence x e r v , by (5), so that, by (8), {F l a' ,x') e Qf, in addition x' e F‘7', 
hence 

F‘a' e Qi“F i y ; 

hence F‘a' e F‘77'. 

6. There remains now only the case 77 e 2D(J5). As before, 77 = 
J5 {/?, 7} and rf = J5 (/?',7'), that is, F‘77 — F‘/3 • P 2 (F‘7) and F‘77' = 
F‘/3'-P 2 (F‘7'). Note that x e P 2 y is equivalent to y • P 2 {x} ^ 0. Suppose 
F‘a e F‘?7; then F‘a e F‘/3, and F‘a e P 2 F‘7, that is, F‘7 • P 2 {F‘a} ± 0. 
F‘a e r and F‘7 e r; hence by (1) [F‘7 • P 2 {F‘a}] e r, therefore by (3) 
there is a u e r such that u e F‘7. u e P 2 {F‘a}. Then by (5) u e r n \ since 
u e F‘7 and (u, F‘a) e P 2 , it follows that v! e F‘7' and {u',F‘a') e P 2 by 
(8), that is, 

F‘a' e P 2 “(F‘7'); 

therefore, since F‘a' e F‘/3', it follows that F‘a' e F‘77'. This concludes the 
proof of 12.6. 

61 | 12.3 follows immediately from 12.6, since if to, o satisfy the hypothesis 

of 12.3, o must be closed with respect to Jj by 12.4 and with respect to C, 
K 1, K‘2 (because a < a, Ki a < a by 9.25 and C‘a < a by definition). 
Hence to, o satisfy the hypothesis of 12.6. 
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But on page 54 it was shown that 12.2 follows from 12.3. So it is proved 
that the generalized continuum hypothesis is a consequence of X) and the 
additional axiom V — L, q.e.d. 18 22 


18 [Note added in 1951: The above given consistency proof can easily be extended for 
the case that stronger axioms of infinity are added (e.g., the axiom of the existence of 
unaccessible numbers, or others given by P. Mahlo (1911, 1913)), for the simple reason 
that all these axioms of infinity imply their own relativized form. A similar remark also 
applies to extensions of the system S by other axioms suggested by the intuitive meaning 
of the primitive terms. Note added in 1966: This holds for the axioms of infinity and 
other additional axioms known at that time (1951).] 

19 [Note added in August 1965 : In the past few years decisive progress in the founda¬ 
tions of set theory has been achieved by Paul J. Cohen, who invented a powerful method 
for constructing denumerable models. This method yields answers to several most im¬ 
portant consistency questions. In particular Paul J. Cohen (1963, 1964, 1966) has 
proved that Cantor’s continuum hypothesis is unprovable from the axioms of set theory 
(including Mahlo or Levy type axioms of infinity), provided these axioms are consistent. 
The value that can consistently be assigned to 2 N “ turns out to be almost completely 
arbitrary. See Cohen 1963, 1964, 1966, Solovay 1963 and Easton 1964, 1964a.] 

20 [Note added in August 1965: Other quite important progress has been made in 
the area of axioms of infinity, namely: 

1. Mahlo’s axioms of infinity have been derived from a general principle regarding the 
totality of sets, that was first introduced by A. Levy (I960). It gives rise to a hierarchy 
of different precise formulations. One, given by P. Bernays, implies all of Mahlo’s axioms 
(see Bernays 1961). 

2. Propositions which, if true, are extremely strong axioms of infinity of an entirely 
new kind have been formulated and investigated as to their consequences and mutual 
implications in Tarski 1962, Keisler and Tarski 1964 and the papers cited there. In 
contradistinction to Mahlo’s axioms the truth (or consistency) of these axioms does not 
immediately follow from “the basic intuitions underlying abstract set theory” (Tarski 
1962, page 134), nor can it, as of now, be derived from them. However, the new axioms 
are supported by rather strong arguments from analogy, such as the fact that they are 
implied by the existence of generalizations of Stone’s representation theorem to Boolean 
algebras with operations on infinitely many elements. As was conjectured in a general 
way in Godel 1947, page 520, one of the new axioms implies the existence of non- 
constructible sets (see Scott 1961). Whether one of them implies the negation of the 
generalised continuum hypothesis has not yet been determined.] 

21 [Note added in August 1965: A general discussion of Cantor’s continuum problem 
and its relationship to the foundations of set theory is given in Godel 194 7 and 1964-] 

22 [Note added in August 1965: A slightly different version of the consistency proof 
given in these lectures, which exhibits more clearly the basic idea of it, is outlined in 
Godel 1939a] 
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62 | Appendix 

The following list is a continuation of the list of page 13 and shows 
by the method explained there that all notions and operations for which 
special symbols are introduced in these lectures (except only ~ and 2Ue) 
are normal. 23 


4.1 Ze X xY. = .(3u,»)[(«,«) =Z.ue X.veY] 

4.11 Z e X 2 . =. Z e X x X (similarly for X 3 ) 

4.2 mtl(X). = .XCV 2 (similarly for SHd 3 ) 

4.4, 4.41, 4.411 Z e Cno (X). =. (3u,«)[(«, v) = Z. ( v , u) e X] 
(similarly for <£n0 2 , Cno 3 ) 

4.42 ZeX + Y. = :ZeX.W.ZeY 

4.43 ZeX — Y. = :ZeX. ~(Z e Y) 

4.45 Ze<B(X). = .ZeV(£m(X)) 

4.5 ZeXfY. = .ZeX-(VxY) (similarly for 1 (4.512)) 

4.52 Ze X“Y. = .ZeW(XtY) 

4.53 ZeX\Y. = . (3 u,v, w)[Z = (u, w) ,(u,v)eX. (v , w) e Y] 

4.6 itrt2 (X) . =: Hn(X) .iln(<£nt>(X)) 

4.61 5nc(X). = :fKel(X).iln(X) 

4.63 X Y. =: Jnc (A). 2) (A) = Y 

4.65 Z e X‘Y . = . (3 u)[Z e u . (»)((», Y) e A. =. v = it)] 

4.8 Z e & (A). =. (Bu)[Z eu.ue X] (the same proposition holds for 
aiTaj: and Hint) 

4.84 Z e<P(X). = :m(Z).Z C X 

6.1 XConY.EE.Y 2 C X + £m(X) + I 

6.3 A6ect z Y . = : Y ■ Z“X C A.ACY 
6.31 Z e &eg T (X, Y) . = . Z e X ■ T“{Y} 

6.4 Z 3fom Pi Q(A, Y). =. 9W (Z ). iltia (Z ). S (Z) = X. 

W(Z)=Y. (u,»)[«, v e X . D. ((«, v) e P . =. (Z‘u, Z l v) e Q)] 

6.5 Comp (A). = .& (A) C A 

6.6 Drh (A). = : Comp (A). E Con A 

6.61 £)(A). = :Drh(A).9«(A) 

a, 0, 7, • • • are normal variables since their range is the class On. 
6.63,6.64 A < Y . = . X e Y; A <Y. = :AcY.V.A = Y 
7.4 ZeX + l. = :ZeX.V.[Z = X.Sm(Z)] 

8.12 A Y. =:. (3ti)pW(«) .lln 2 («). A = ® (u). Y = 2&(u)] 

8.2 Z eX .= : Z eOn. (a)[a A . D. Z e a] 




Continuum, hypothesis 


99 


8.48,8.49 3 m(X). = .(3a)[aew.X~'a], 3nf (X) . = . ~3tn(X) 

9.1 Ze^(X,Y). = .Ze{X,Y}, Z e$ 2 (X,Y) =. Z e E ■ X 

(similarly for ga, • • •, gs) 

9.41 £(X). = :X CL.(u)[ueLDu-X eL] 


Index 


I. Special Symbols 


{x,y}, (1.1), 3 
M, (1-11), 3 
{X,Y}, (3.1), 11 
(x), (1.17), 4 
( x,y ), (1.12), 4 
(x,y,z), (1.14), 4 

<*!,..., x„), (1-15), 4 
(X, Y), (3.12), 11 

0 , ( 2 . 1 ), 8 
XCY, (1.2), 4 
X C Y, (1.2), 4 
X ■ Y, (1.4), 5 
-X, (1.41), 5 
X x Y, (4.1), 14 
X 2 , (4.11), 14 
X 3 , (4.12), 14 
X" 1 , (4.412), 15 


X, (4.412), 15 
X + Y, (4.42), 15 
X-Y, (4.43), 15 
uXv, (4.211), 14 
X T Y, (4.5), 15 
Y1X, (4.512), 16 
X“Y, (4.52), 16 
X | Y, (4.53), 16 
X‘Y, (4.65), 16 
X < Y, (6.63), 23 
X < Y, (6.64), 23 
X + 1, (7.4), 26 
1, 2, 3, etc., 
(7.44,7.45), 26 
X ~ Y, (8.1), 30 
X Y, (8.12), 30 
£i,...,£„[ ], 15 

T, (8.2), 30 


* (at the number of a theorem or definition), 7 
(x), (3x), (£!x), V, ., D, =, ~, =, 2 


II. Letters and Combinations of Letters 

(Note that the letters C, F, R, S also occur as variables before their 
respective definitions as constants. Operations and notions are denoted in 
general by German letters, classes by Latin letters.) 
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As, (11.8), 53 
Aeq, (8.13), 30 
C, (11.81), 53 

eta, 3 

Cra>, (4.4), 15 
CnOi (* = 1,2,3), 

(4.41-4.412), 15 
Comp, (6.5), 23 
Con, (6.1), 21 
Do, (5.17), 19 
£>, (1.5), 5 

E, (4.3), 15 
Cm, (1.22), 4 
<&, (1-23), 4 
gin, (8.48), 32 

F, (9.3), 37 

(9-1), 35 
(?nc, (4.61), 16 
tfn, (4.63), 16 

l, (4.31), 15 
3nf, (8.49), 32 
Jfom, (6.4), 22 
J, (9.21), 36 
Ji (* = 0,..., 8), 

(9.22), 36 
Ki,K2 , (9.24), 36 
KuKu, (7.42,7.43), 26 
L, (9.4), 38 
£, (9.41), 38 
l (subscript), 42 
Le, (7.8), 28 
£im, (7.31), 25 

m, 3 

2%, (7.31), 25 
M1-M6, 8, 13, 14, 15, 16, 16 


N, (8.21), 30 
N', (8.54), 33 
Nc, (8.20), 30 

O, (6.61), 23 
Od, (9.421), 38 
On, (6.62), 23 
Dtf), (6.6), 23 

P, (7.9), 29 
qi, (4.84), 17 
ppf, 8 

yt, (i), 3 
Pi(i = 1,...,5), 
(4.71-4.75), 17 
Qi (* = 4,...,8), 
(9.14), 36 

R, (7.81), 28 
SKel, (4.2), 14 
Mein, (4.21), 14 
6, (4.8), 17 

S, (9.2), 36 
©eg, (6.31), 22 
Sect, (6.3), 21 
iln, (1.3), 5 
ilna, (4.6), 16 
V, (2.2), 8 
2», (4.44), 15 
2De, (6.2), 21 
K, (8.57), 33 
« a , (8.59), 33 
u) a , (8.59), 33 
u, (8.4), 31 

e, 3 
E, 7 
A, 41 


Variables: 


X,Y,Z,..., A, B,C,..., for classes 
x,y,z,..., a,b,c,..., for sets 
a, (3, 7,..., for ordinal numbers 
for integers 

X, Y,..., A, B,..., for constructible classes 
x,y,..., a, b, ..., for constructible sets 
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III. Technical Terms 


absolute (notion, operation, 
class, variable), 42 
asymmetric, (6.12), 21 
cardinal, (8.2), 30 
cardinal number, (8.20, 8.21), 30 
class, 3 

class, particular, 11 
class, proper, (1), 3 
closed, (8.7, 8.71), 34 
closure, (8.72), 34 
complement, (1.41), 5 
complete, (6.5), 23 
concept, 12 

confined to, (4.5, 4.512), 15-16 
connex, (6.1), 21 
constructible, (9.4, 9.41), 38 
converse, (4.4, 4.41, 4.411), 15 
designated element, (11.8), 53 
domain, (1.5), 5 
domain of values, (4.44), 15 
empty, (1.22), 4 
equivalent, (8.1, 8.12), 30 
exclusive (mutually), (1.23), 4 
finite, (8.48), 32 
function, (4.61), 16 
function over, (4.63), 16 
fundamental operation, (9.1), 35 
image (by), 14 
infinite, (8.49), 32 
integer, (8.4), 31 
intersection, (1.4), 5 
isomorphism (isomorphic), 

(6.4, 6.41), 22 
limit, (7.31), 25 
maximum, (7.31), 25 
minimal formula (or proposi¬ 
tional function), 12 
monotonic (strictly), (7.61), 27 
normal (notion, operation, 
variable, term, propositional 
function), 12 
notion, 11 


n-tuple, (1-15), 4 
|null class, (2.1), 8 
one-many, (1.3), 4, 5 
one-to-one, (4.6), 16 
operation, 11 
order, (9.421), 38 
ordinal, (6.6), 23, cf. also page 22 
ordinal number, (6.61), 23 
ordinal function, (7.6), 27 
ordinal number of first and 
second kind, (7.42, 7.43), 26 
original, 14 

pair, non-ordered, (1.1), 3 

pair, ordered, (1.12), 4 

particular class, 11 

postulate, defining, 12 

power set (power class), (4.84), 17 

primitive propositional function, 

8 

product, outer (or direct), (4.1), 
14 

proper class, (1), 3 
propositional function, 12 
range (of a variable), 12 
relation, (4.2), 14 
relation, n-adic, (4.21), 14 
relativization (of notions, 

operations, particular classes, 
variables), 42 
set, 3 

segment, (6.31), 22 
section (proper section), 

(6.3, 6.30), 21 
single-valued, (1.3), 4, 5 
sum, (4.42, 4.8), 15, 17 
term, 12 

transitive, (6.11), 21 
triple, ordered, (1.14), 4 
universal class, (2.2), 8 
value, 14 

variable (kind of), 11 
well-ordered by, (6.2), 21 





Introductory note to 1944 


This paper was written for The philosophy of Bertrand Russell , a 
volume of Paul Arthur Schilpp’s series the Library of living philosophers. 
In his letter of invitation of 18 November 1942, Schilpp proposed the title 
of the paper and also wrote: “In talking the matter over last night with 
Lord Russell in person, I learned that he too would not only very greatly 
appreciate your participation in this project, but that he considers you 
the scholar par excellence in this field.” Godel sent in the manuscript on 
17 May 1943. There followed a lengthy correspondence about stylistic 
editing proposed by Schilpp and Godel’s own deliberation concerning 
revision. Before Godel had submitted the final version, Russell had 
completed his reply to the other papers and decided that under the 
circumstances he would not reply to Godel’s. When Godel finally sent 
in the revised version on 28 September, he wrote to Russell attempting 
to change his mind about not replying. He undertook to disabuse Russell 
of his impression that what Godel said would not be controversial, and 
emphasized his criticisms of Russell. However, Russell confined himself 
to the following brief note: 

Dr. Godel’s most interesting paper on my mathematical logic came 
into my hands after my replies had been completed, and at a time 
when I had no leisure to work on it. As it is now about eighteen 
years since I last worked on mathematical logic, it would have taken 
me a long time to form a critical estimate of Dr. Godel’s opinions. 

His great ability, as shown in his previous work, makes me think it 
highly probable that many of his criticisms of me are justified. The 
writing of Principia Mathematica was completed thirty-three years 
ago, and obviously, in view of subsequent advances in the subject, 
it needs amending in various ways. If I had the leisure, I should 
be glad to attempt a revision of its introductory portions, but 
external circumstances make this impossible. I must therefore ask 
the reader to give Dr. Godel’s work the attention that it deserves, 
and to form his own critical judgment on it. ( Schilpp 1944 > page 
741) 

Godel subsequently contributed 1949a to the volume on Einstein in 
Schilpp’s series and also accepted an invitation to contribute to the 
Carnap volume ( Schilpp 1963). Several drafts of this paper on Carnap 
survive in Godel’s Nachlass, but it was never actually submitted. Still 
later, Godel declined an invitation to contribute to the Popper volume. 

The paper 1944 was reprinted twice {1964a, 1972b), with only edito¬ 
rial changes in the text. In an opening footnote added in 1964a, Godel 
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clarifies the difference between the use of the term “constructivistic” in 
that paper and the more usual uses; this remark is revised and expanded 
in 1972b. 

The paper is notable as Godel’s first and most extended philosophi¬ 
cal statement. It holds mainly to the form of a commentary on Russell; 
as such it has been quite influential. However, Godel is not reticent in 
expressing his own views. In the present note I shall give more emphasis 
to what it reveals about the thought of its author. 

The organization of the paper is difficult for the present commenta¬ 
tor to analyze. He cannot but endorse Hermann Weyl’s remark that 
the paper “is the work of a pointillist: a delicate pattern of partly dis¬ 
connected, partly interrelated, critical remarks and suggestions” {1946, 
page 210). Nonetheless, Godel’s paper might be divided as follows: 

1. Introductory remarks (125-128). 

2. Russell’s theory of descriptions (128-131). 

3. The paradoxes and the vicious-circle principle (131-137). 

4. Godel’s own realistic view of classes and “concepts” (137-141). 

5. Contrast with Russell’s “no-classes theory” and the ramified theory 
of types; limitations of the latter (141-147). 

6. The simple theory of types (147-150). 

7. The analyticity of the axioms of Principia (150-152). 

8. Concluding remarks on mathematical logic and Leibniz’ project 
of a universal characteristic (152-153). 

I follow this division in the remainder of this note, where I use a number 
in parentheses without a date to indicate a page number in Godel 1944- 

The Godel archive contains reprints of 1944 (designated below as A- 
D) and a loose page (designated E) containing annotations to it. All 
changes on A-E are listed in the textual notes for 1944 a t the end of 
this volume. Many of the annotations are textual emendations, for the 
most part either of a stylistic nature or for greater explicitness. A few 
indicate changes of view on specific points. There is no way of knowing 
whether any of these changes represents a final position for Godel. They 
are hardly reflected at all in the reprints 1964a and 1972b. E contains 
some remarks on Bernays’ review 1946 of the paper. I will comment 
below on only a few of the annotations. 


1. Introductory remarks 

Early on, Godel remarks on Russell’s “pronouncedly realistic atti¬ 
tude” and the analogy with natural science expressed by Russell’s re¬ 
mark, “Logic is concerned with the real world just as truly as zoology, 



104 


Note to 1944 


though with its more abstract and general features” (1919, page 169). 
He also mentions another, epistemological analogy in Russell’s view that 
the axioms of logic and mathematics do not have to be evident in them¬ 
selves but can obtain justification from the fact that their consequences 
agree with what has been found evident in the course of the history of 
mathematics (127). Godel remarks: “This view has been largely justi¬ 
fied by subsequent development, and it is to be expected that it will be 
still more so in the future.” 

The essay as a whole might be seen as a defense of these attitudes of 
Russell against the reductionism prominent in his philosophy and im¬ 
plicit in much of his actual logical work. It was perhaps the most robust 
defense of realism about mathematics and its objects since the para¬ 
doxes had come to the consciousness of the mathematical world after 
1900. Bernays’ earlier defense of realism (for example, in 1935 ) was 
more cautious. Godel begins to develop this theme when he turns to 
Russell’s approach to the paradoxes (see especially §§3-5 below). 


2. The theory of descriptions 

This discussion is noteworthy. Godel indicates (128-129 and note 5) 
a formal argument for Frege’s thesis that the signification (his transla¬ 
tion of Frege’s Bedeutung) of two sentences is the same if they agree in 
truth value. The argument collapses intensional distinctions on the ba¬ 
sis of simple assumptions about signification. A similar argument to the 
same conclusion, from somewhat different assumptions, occurs in Church 
1943, pages 299-300. Such collapsing arguments have been prominent 
since in philosophical discussions of meaning and reference, modality 
and propositional attitudes. Godel concedes that Russell’s theory of de¬ 
scriptions avoids Frege’s conclusion and allows a sentence to signify a 
fact or a proposition. 8. He expresses the suspicion that it only evades the 
problem (130). 


a Church (1942) observes that Russell’s theory of descriptions eliminates apparent 
violations of the substitutivity of identity in intensional contexts. The universal 
substitutivity of identity is one of the assumptions on which Godel’s and Church’s 
collapsing arguments turn. Church’s observation is the basis for the reply to Quine’s 
criticism of modal logic in Smullyan 1948. Quine’s use of the argument in criticizing 
modal logic is its most widely known and influential use. However, it does not 
occur in either of his two early papers on this theme (1943 and 194 7). The earliest 
occurrence of a collapsing argument of the Godel-Church type that I have been able 
to find in Quine’s writings is 1953, p. 159; see also 1953a {1916, pp. 163-164). Both 
these arguments are essentially the same as Church’s. 
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3. The paradoxes and the vicious-circle principle 


Godel says that Russell “freed them [the paradoxes] from all math¬ 
ematical technicalities, thus bringing to light the amazing fact that our 
logical intuitions (i.e., intuitions concerning such notions as: truth, con¬ 
cept, being, class, etc.) are self-contradictory” (131). Many readers 
have been puzzled by the contrast between this statement and the de¬ 
fense of the concept of set in 1947, where he says that the set-theoretical 
paradoxes “are a very serious problem, but not for Cantor’s set theory” 
(page 518), revised in 1964 to “• • • problem, not for mathematics, how¬ 
ever, but rather for logic and epistemology” (page 262). b Closer exam¬ 
ination makes the passages not difficult to reconcile. In the later paper 
Godel says that the concept of set in contemporary mathematics, includ¬ 
ing Cantor’s set theory, can be taken to be what we call the iterative 
conception of set, according to which sets are obtained by iterated ap¬ 
plication of the formation of sets of previously given objects, beginning 
with some well-defined objects such as the integers. It is this conception 
that has “never led to any antinomy whatsoever” and whose “perfectly 
‘naive’ and uncritical working” has “so far proved completely self-consis¬ 
tent” (1964, page 263). 

In the present paper, Godel is following Russell in being concerned 
with the foundations of logic in a larger sense; note that in the above 
quotation our logical intuitions are said to be “concerning such notions 
as: truth, concept, being, class, etc.” Though it is mentioned (144), the 
iterative conception of set is kept in the background, perhaps more so 
than was optimal for the purpose of defending realism. But, as we shall 
see, Godel’s purpose was not limited to defending realism about sets or 
the objects of classical mathematics. 

After short remarks about two proposals that Russell discussed briefly 
in 1906 , the “theory of limitation of size” and the “zig-zag theory”, 
Godel turns to the vicious-circle principle. The masterly analysis of 
the ambiguities of this principle and the criticism of the principle itself 
constitute probably the most influential piece of direct commentary on 
Russell in the essay and also supply a major argument for Godel’s own 
position. Godel’s main criticism, which had already been intimated by 
Ramsey in 1926 (1931, page 41), is that the strongest form of the prin¬ 
ciple, that no totality can contain members definable only in terms of 
this totality, is true only if the entities whose totality is in question are 
“constructed by ourselves”. Godel adds: “If, however, it is a question of 


b Perhaps Godel thought it necessary to clarify the difference of his concerns from 

those of 1944■ 
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objects that exist independently of our constructions, there is nothing in 
the least absurd in the existence of totalities containing members, which 
can be described ... only by reference to the totality” (136). Earlier 
Godel observed that this form of the principle is not satisfied by classi¬ 
cal mathematics or even by the system of Principia mathernatica with 
the axiom of reducibility. He considered this “rather as a proof that the 
vicious-circle principle is false than that classical mathematics is false” 
(135). 

These remarks lead Godel into the declaration of his realistic point 
of view. But before discussing this we should note his characteriza¬ 
tion of the position that would justify the strong vicious-circle principle, 
which he calls constructivistic or nominalistic. He seems to regard this 
viewpoint as involving the eliminability of reference to such objects as 
classes and propositions (136-137). His model is clearly Russell’s no¬ 
class theory. 


4. Godel’s realism 

Classes and concepts, according to Godel, may be understood as real 
objects “existing independently of our definitions and constructions” 
(137). It should be stressed, as it has not been in previous commen¬ 
taries on this paper, that Godel’s realism extends not only to sets as 
described in axiomatic set theory, but also to what he calls concepts: 
“the properties and relations of things existing independently of our 
definitions and constructions”. He is clearly referring to both in his 
often-quoted remark that “the assumption of such objects is quite as 
legitimate as the assumption of physical bodies” and that “they are in 
the same sense necessary to obtain a satisfactory system of mathematics 
as physical bodies are necessary for a satisfactory theory of our sense 
perceptions” (137). These remarks have excited critical comment. 0 To 
deal in an adequate way with the questions they raise would be quite 
beyond the scope of a note of this kind. However, we should state some 
of these questions. 

(i) What does Godel mean by “real objects” and “existing inde¬ 
pendently of our definitions and constructions”? A question of this 
kind arises about any form of philosophical realism. If realism about 
sets and concepts is to go beyond what would be asserted by a non- 


c For example, Chihara 1973, pp. 61, 75-81; Chihara 1982, part I; Dummett 1978, 
p. 204. 
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subjectivist form of constructivism, this reality and independence will 
have to amount to more than objective existence. In the present paper, 
Godel does not undertake very directly to clarify his meaning, though 
something can be learned from his discussion of the ramified theory of 
types (see below) . d Of course, the general question “What is realism?” 
has been much debated in quite recent times, largely through the stim¬ 
ulation of the writings of Michael Dummett. e 

A point that needs to be stressed, however, is that Godel saw his real¬ 
ism in the context of concrete problems and as motivating mathematical 
research programs. This is perhaps most evident in 1947/1964, with its 
defense of the view that the continuum hypothesis is definitely true or 
false even though probably (and by 1963 certainly) independent of the 
established axioms of set theory. Further reflection shows that it is very 
much present in the paper under discussion, where Godel criticizes ideas 
of Russell that obstructed the transfinite extension of the hierarchies of 
simple and ramified type theories. The remarks about his own theory of 
constructible sets (146-147; see below) are an illustration of the “cash 
value” of realism for Godel. f 

(ii) How does Godel understand the parallel between the objects of 
mathematics and “physical bodies”? It would be tempting to suppose 
that Godel views sets and concepts as postulated in a theory to explain 
certain data. 8 This is suggested by the parallel itself between the ne¬ 
cessity of sets and concepts for a “satisfactory system of mathematics” 
and the necessity of physical bodies for a “satisfactory theory of our 
sense perceptions”; it is also in line with Godel’s approval, noted above, 
of Russell’s suggestion that mathematical axioms can be justified by 


d In 1949a Godel argues that the general theory of relativity calls in question the 
objectivity of time and change. He sees this as a confirmation of idealistic views, in 
particular Kant’s. He does not attempt to draw any parallel with a possible anti¬ 
realist view of mathematics. 

Of course, Kant did not “deny the objectivity of change” (Godel 1949a, p. 557) 
if what is meant by the latter is the existence of an objective temporal order that is 
the same for all observers with our forms of intuition. Evidently Godel thinks of the 
dependence of the temporal ordering of events on the position and state of motion of 
the observer according to relativity theory as parallel to the dependence of the very 
temporality of the experienced world on the constitution of our cognitive faculties 
according to Kant. 

e For example, the essays in Dummett 1978, especially “Truth”, “Realism”, and 
“The reality of the past”. 

f Cf. note 48a of Godel 1931, which Burton Dreben called to my attention. 

s This interpretation is assumed by Dummett (1978, p. 204) and in some of Chi- 
hara’s criticisms (for example, 1982, pp. 214-215). 
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their consequences. But more direct evidence for this interpretation is 
lacking. 11 

Godel does, however, use the notion of “data” with reference to math¬ 
ematics. Indeed, one point of parallelism between mathematics and 
physics is simply that “in both cases it is impossible to interpret the 
propositions one wants to assert about these entities as propositions 
about the ‘data’” (137). But at this point he refrains from saying what 
plays the role of data in the case of mathematics. We shall return to 
this matter in connection with his discussion of the ramified theory. 

(iii) By “concepts” Godel evidently means objects signified in some 
way by predicates. The notion “property of set”, which he counts among 
the primitives of set theory (1947, note 17, or 1964, note 18), is clearly 
a special case of this notion. Why he should have considered “property 
of set” a primitive of set theory is clear enough from the role of classes 
in set theory and from the generalization with respect to predicates con¬ 
tained in the axioms of separation and replacement. Godel therefore did 
not lack mathematical motivation for adding something like concepts to 
his ontology. 1 

But what sort of theory of concepts did Godel envisage? What con¬ 
sequences for the theory does realism about concepts have, once the 
existence of sets as “real objects” is granted? The use of the notion of 
class that is standard in set theory is predicative relative to the universe 
of sets. In the above-cited note, he seems to envisage impredicative the¬ 
ories of properties based on the simple theory of types, which he also 
mentions as a theory of concepts in the present paper (140). In the note 
he remarks that such theories are not deductively stronger than exten¬ 
sions of the axioms referring to sets. 

It is clear that Godel takes his realism about concepts to justify an 
impredicative theory, and he suggests strongly that he would prefer a 
stronger theory than the simple theory of types. He claims (139) that 
impredicative specifications of properties do not themselves lead to ab¬ 
surdity and that a property might “involve” a totality of properties to 


h Godel does not use the language of explanation in the two passages where he 
is most explicit about the justification of mathematical axioms through their conse¬ 
quences ( 1964 , PP- 265, 272), although in the former he does describe a (hypothetical) 
situation in which axioms “would have to be accepted in at least the same sense as 
any well-established physical theory.” In spite of its lack of direct support, the inter¬ 
pretation in terms of explanation is difficult to refute. 

In these passages in 1964 , Godel seems to me to be considerably more cautious 
about justification of axioms by their consequences than he appears to be in the 
above-noted passage (127). 

‘This view may be reflected in Godel’s choice of a theory with class variables as 
the framework for 1940 . Note the remark on page 2, “Classes are what appear in 
Zermelo’s formulation ... as ‘definite Eigenschaften’.” 
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which it belongs, thus contradicting the second of the three forms of the 
vicious-circle principle he has earlier distinguished.' He also remarks: 
“Nor is it self-contradictory that a proper part should be identical (not 
merely equal) to the whole, as is seen in the case of structures in the 
abstract sense” (139). Of course, on the set-theoretic conception of 
structure, such identity (as opposed to isomorphism) does not obtain 
and would indeed be self-contradictory, at least if one takes “proper 
part” in its obvious meaning of a substructure whose domain is a proper 
subset of the whole. Godel is evidently thinking in terms of an informal 
notion of structure according to which isomorphic “structures” (in the 
set-theoretic sense) are the same structure. But it is a problem to con¬ 
struct a theory in which this sameness is interpreted as identity. 15 

Godel seems to regard the simple theory of types as the best presently 
available solution to the paradoxes for a theory of concepts, but “such 
a solution may be found ... in the future perhaps in the development of 
the ideas sketched on pages 132 and 150” (140). The former refers to his 
remarks on Russell’s “zig-zag theory”, the latter to the frequently quoted 
but enigmatic suggestion that a concept might be assumed “significant 


j Whether the simple theory of types conflicts with this form of the principle de¬ 
pends on how it is interpreted. It seems clear, for example from the emphasis Godel 
places on the claim (136) that classical mathematics, and in particular Principia with 
reducibility, does not satisfy the first form, that Godel thinks that the extensional 
simple theory of types with its higher-order variables interpreted to range over sets 
does satisfy the second form (and the third as well). Whether an intensional form of 
the simple theory, in which the variables range over properties and relations, satisfies 
the second form will depend on the underlying notion of intension. Godel’s remark 
(139) that “the totality of all properties (or of all those of a given type) does lead 
to situations of this kind”, in which the second form is violated, makes it clear that 
he envisages a notion of property that would lead to an interpretation of the simple 
theory of types where the second form of the principle is violated. Godel’s anno¬ 
tations to reprint A, however, call this into question. In 1944j the violation of the 
second form of the principle is said to arise because a universal quantification over 
properties of a given type contains these properties as constituents of their content 
(139). This is questioned in A on the ground that universal quantification “does not 
mean in the same way as conjunction does.” 

It should be noted that Godel’s first remark in E on Bemays 1946 is “Misun¬ 
derstanding of my interpretation of type theory for concepts.” Presumably he is 
attributing such a misunderstanding to Bemays. I am unable to determine in what 
the misunderstanding consists. 

k In the language of category theory, we could say that an object A is a proper 
part of an object B if there is a monomorphism of A into B that is not epi; this does 
not exclude identity. The alternative, to say that A is a proper part of B if there is a 
monomorphism of A into B but there is not an epimorphism, does of course exclude 
identity but does not fit what Godel says. 

Of course, the relevant difference between the construal of structures as tuples 
of a domain and relations on it, and the language of categories, is that the former 
forces, while the latter does not, a distinction between isomorphism and identity. 
This is a much more superficial difference than the matter of the self-applicability of 
categories (see below). 
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everywhere except for certain ‘singular points’ or ‘limiting points’, so 
that the paradoxes would appear as something analogous to dividing 
by zero.” Evidently he has in mind a type-free theory; he mentions as 
attempts the work of the early 1930s on theories based on the A-calculus 
(in particular Church 1932, 1933), which he views as having had a nega¬ 
tive outcome, in view of Kleene and Rosser 1935. Godel does not return 
to this theme in later publications, except for the brief remark that 
“the spirit of the modern abstract disciplines of mathematics, in partic¬ 
ular the theory of categories, transcends this [iterative! concept of set, 
as becomes apparent, e.g., by the self-applicability of categories” (1964, 
page 262, footnote 12; not in 1947). However, he evidently thought that 
Mac Lane’s distinction between large and small categories captured “the 
mathematical content of the theory” as it then stood. But the program 
of constructing a strong type-free theory has attracted others, with in¬ 
conclusive results so far. 1 

Godel’s remarks about realistic theories of concepts in the present 
paper have an inconclusive character; no available theory satisfies him. 
In later publications, as we have noted, he is virtually silent on the sub¬ 
ject. The question arises whether Godel himself worked on the project 
of constructing a theory that would answer to his conception. Whether 
he did is not known, but the absence of more definite information would 
suggest the conjecture that he never formulated such a theory to his own 
satisfaction. It is to be hoped that transcriptions of Godel’s shorthand 
notebooks will shed light on these questions. 


5. The ramified theory of types 

Returning to Russell, Godel begins his discussion of the ramified the¬ 
ory by remarking on Russell’s “pronounced tendency to build up logic 
as far as possible without the assumption of the objective existence of 
such entities as classes and concepts” (141). He reads the contextual 
definitions of locutions involving classes in Principia as a reduction of 
classes to concepts, m but reasonably enough finds matters not so clear 
when it comes to concepts and propositions. Influenced especially by 


'For a survey see Feferman 1984 and its sequel. 

m Godel’s reading is misleading in that he clearly understands concepts to be ob¬ 
jects, while the “ambiguity” that Russell attributes to propositional functions is close 
to Frege’s “unsaturatedness”. But, from his later comment on the notion of proposi¬ 
tional function (147-148), it is clear that this does not result from misunderstanding 
but is rather a conscious assimilation of Russell’s conceptual scheme to his own. See 
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the introduction to the second edition, Godel finds in Principia a pro¬ 
gram according to which all concepts and propositions except logically 
simple ones are to “appear as something constructed (i.e., as something 
not belonging to the ‘inventory’ of the world)” (142). This program 
offers an intrinsic motivation for the ramification of the type hierarchy, 
but does not yield a theory strong enough for classical mathematics, for 
well-known reasons: the impredicative character of standard arguments 
in analysis, and the question whether, to construct number theory, one 
can replace the Frege Russell definition of the predicate “natural num¬ 
ber” by one in which the second-order quantifier is restricted to a definite 
order (145-146)." 

It is not as clear as it might be how Godel sees the realization of the 
program he attributes to Russell even to obtain ramified type theory 
without reducibility. The introduction to the second edition of Prin¬ 
cipia proceeds on the basis of the Wittgensteinian idea that “functions 
of propositions are always truth functions, and that a function can only 
occur in a proposition through its values” (Whitehead and Russell 1925, 
page xiv). But it is hard to see how propositions involving quantifiers 
are to be interpreted as truth functions of atomic propositions unless 
infinitary propositional combinations are allowed, as Ramsey in effect 
proposed; Godel’s comment on that is that one might as well adopt 
the iterative conception of sets as pluralities (144). Godel says that 
Russell “took a less metaphysical course by confining himself to such 
truth-functions as can actually be constructed” (145). But what are 
the allowed means of construction? Godel apparently has in mind an 
interpretation of the ramified theory in which the higher-order variables 
range over predicates, that is, the linguistic expressions that “express” 
the propositional functions that the quantifiers range over on the naive 
reading. This is indicated by Godel’s later remark that for propositional 
functions to be “defined (as in the second edition of Principia) to be 
certain finite ... combinations (of quantifiers, propositional connectives, 
etc.)” (146) would presuppose the notion of finiteness and therefore 
arithmetic, and by the earlier characterization of the constructivist view 
he is trying to explicate as a form of nominalism (136-137). 

Such an interpretation is certainly possible and well known, provided 
that at the level of individuals one has elementary syntax. But this read¬ 
ing of Godel still leaves some puzzles. Translating a statement about 
propositional functions of order n as one about predicates, namely pred¬ 
icates containing quantifiers only for propositional functions of order 


n Godel leaves the latter question open. A negative answer is claimed without 
proof in Wang 1959 (see 1962 , p. 642). A full treatment, with a proof, is given in 
Myhill 1974. 
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< n, requires the notion of satisfaction, or at least truth, for the lan¬ 
guage with quantifiers of order < n. It is surprising that Godel nowhere 
remarks on this fact, particularly since in this context we would have 
to suppose satisfaction or truth introduced by an inductive definition, 
another obstacle to applying the idea in reducing arithmetic to logic. 
Moreover, such an interpretation entirely eliminates quantification over 
the sort of entities Godel calls concepts, at least in the absence of such 
locutions as propositional attitudes, and therefore does not leave “the 
primitive predicates and relations such as ‘red’ or ‘colder’ ” as “real ob¬ 
jects” (142). But to suppose that Godel would have remarked on the 
latter point if he had had the present interpretation in mind may be 
to attribute to him a Quinean distinction of “ontology” and “ideology” 
that is foreign to him. 

The difficulties faced by this last interpretation suggest to me that 
Godel did not distinguish clearly in his own mind between a nominalist 
theory of concepts in which such entities are eliminated, and a theory in 
which every concept in the range of a quantifier is “signified” by an ex¬ 
pression for it that is antecedently understood, but in which reference to 
concepts is not actually eliminated, because, presumably, one does not 
give a contextual definition of quantifiers over them. The latter sort of 
theory might now be realized by a substitutional interpretation of quan¬ 
tifiers over concepts; given such an interpretation, the most that can be 
asked in establishing the “existence” of a concept is the construction of 
a meaningful expression “signifying” it. 

Something more should be said about Godel’s use of the term “data”. 
In the discussion of the ramified theory it refers to that on the basis of 
which classes and concepts are constructed, or perhaps to what is al¬ 
lowed as primitive in a theory in which reference to classes and concepts 
is eliminated. (See for example note 33, page 142.) Godel does not 
say here what epistemological force this might have. The analogy with 
sense perception may be limited to the context of interpreting Russell, 
who was interested in a program that would represent the objects of 
physics as “logical constructions” from sense-data. In Godel’s own epis¬ 
temological view of mathematics, what corresponds most closely to sense 
perception is something quite different, namely elementary arithmetical 
evidence (see 128). 

Like many other commentators, Godel found that in the first edition 
of Principia the constructivistic attitude was fatally compromised by the 
axiom of reducibility, but his description (143) of what survived is worth 
noting. 0 However, Godel’s remarks about the axiom of reducibility show 


“Godel mentions here the treatment of propositional connectives as applied to 
propositions containing quantifiers, presumably in *9, which he says “proved its 
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lack of sensitivity to the essentially intensional character of Russell’s 
logic; the fact that every propositional function is coextensive with one 
of lowest order does not imply “the existence in the data of the kind of 
objects to be constructed” (141), if the objects in question are concepts 
or propositional functions rather than classes. It is similarly misleading 
to say that, “owing to the axiom of reducibility, there always exist real 
objects in the form of primitive predicates, or combinations of such, cor¬ 
responding to each defined symbol” (143). Russell himself was closer to 
the mark in saying that the axiom accomplishes “what common sense 
effects by the admission of classes''' (1908, page 167 of van Heijenoort 
1967). The ramified theory with reducibility would fit well with a con¬ 
ception according to which classes are admitted as “real objects”, but 
the conception of propositional functions (concepts) is constructivistic. 
This insensitivity is quite common in commentators on Russell, but is 
somewhat surprising in Godel, since his own “concepts” are evidently 
intensions of a kind, and his 1958 shows a very subtle and fruitful han¬ 
dling of intensional notions. 1 * 

Godel concludes his discussion of the ramified theory with well known 
remarks in which he views his own theory of constructible sets as an ex¬ 
tension of the hierarchy of orders, now within the framework of ordinary 
(impredicative) mathematics, to arbitrary transfinite orders (146-147). 
After what I have said about Godel’s treatment of the axiom of re¬ 
ducibility, I should call attention to his characterization of his theorem 
that every constructible set of integers has order < uii as a “transfinite 
theorem of reducibility” q —thus, when set against Russell, a striking ap¬ 
plication of his realistic point of view. 

Godel remarks that even from the predicative standpoint an exten¬ 
sion of the hierarchy of orders is possible and, moreover, demanded by 
the theory. This remark may be the first suggestion of a program that 


fecundity in a consistency proof for arithmetic”. The connection between *9 and Her- 
brand’s work suggests that he has in mind Herbrand 1931, which, however, covered 
only first-order arithmetic with quantifier-free induction. A proof within Herbrand’s 
framework that covers full first-order arithmetic became known only some years later; 
see Dreben and Denton 1970 and Scanlon 1973. Could Godel have seen at this time 
how to extend Herbrand’s proof? The errors in the proof of the fundamental theorem 
in Herbrand 1930 would have been an obstacle. At the time Godel was at least aware 
of a difficulty; see van Heijenoort 1967, p. 525. 

Another possibility, suggested by van Heijenoort, is that Godel was thinking of 
the proof he published in 1958, which he had discovered not long before writing the 
present paper. This now seems to me somewhat more likely. 

p Of course, the theory of intensional equality in 1958 is very different from more 
usual constructions in intensional logic, and in the present paper he suggests that 
concepts might obey extensionality (137). 

q In reprint A, Godel amended the text in a way that omits this phrase. 
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was pursued by several logicians from 1950 on, of extending and ana¬ 
lyzing the resources of predicative mathematics by means of ramified 
theories with transfinite orders. The first work of this kind (Lorenzen 
1951a, 1951b, 1955 ; Wang 1954) sought to give a better reconstruction 
of classical mathematics than earlier predicative work by constructing 
transfinite ramified theories. It left open the question what ordinal lev¬ 
els can be admitted in such a construction. 

Godel already offers a hint in remarking that one can extend the hier¬ 
archy of orders “to such transfinite ordinals as can be constructed within 
the framework of finite orders” (147). It seems evident that such a pro¬ 
cedure might b e iterated; this gives rise to the notion of autonomous 
iteration that is prominent in later analyses of predicativity. The first 
such proposal is made in Wang 1954 : given an interpreted language £ a 
(in his setting, ramified set theory with ordinal levels < a), one extends 
it by admitting as new levels ordinals /? such that a well-ordering of type 
j3 is definable in £ Q (1962, page 579). In this case, since Wang’s the¬ 
ory £0 could define all recursive orderings, the iteration closes after one 
step. This was shown in Spector 1957. Spector constructed a sequence 
of systems similar to Wang’s, indexed by the recursive ordinal notations, 
and showed that any well-ordering definable in one of the systems has 
as order type a recursive ordinal; he also showed that the sets of nat¬ 
ural numbers definable in some system of the sequence are exactly the 
hyperarithmetic sets. 

Kleene’s work on the hyperarithmetic hierarchy related it to a trans¬ 
finite ramified hierarchy, and a number of technical results suggested the 
thesis, advanced with some reservations by G. Kreisel (1960a, page 373), 
that a set of natural numbers is predicatively definable if and only if it 
is hyperarithmetic. Kreisel seems to have been more confident of the 
“only if” than of the “if’ part of this thesis (1960a, pages 387-388; see 
also 1962, page 318, and Feferman 1964, page 10). The main reservation 
about the latter concerned the use in the definition of a hyperarithmetic 
set of the notion of a recursive ordinal; should one not demand, for 
an ordinal to count as predicatively obtained, that an ordering of that 
type be predicatively recognizedto be a well-ordering (1960a, page 387)? r 


'Accepting this demand seems to involve giving up the attempt to characterize 
predicative definability independently of predicative provability. In fact the inde¬ 
pendent discussion of predicative definability seems to have petered out when Fefer- 
man’s and Schiitte’s results became known. 

That a definite meaning can be given to the notion “predicatively definable set 
of natural numbers” without using impredicative concepts seems very doubtful, and 
Kreisel already stressed in 1960a that he was approaching the analysis of predicativ¬ 
ity with the help of impredicative notions, that of a well-ordering in particular. (See 
also Lorenzen’s reasons for rejecting the question how far his iterative construction 
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This question leads into the analysis of predicative provability, in which 
another type of autonomous iteration (first suggested in Kreisel 1960) 
plays a role. In the context of ramified theories, one constructs a progres¬ 
sion of theories (in the precise sense of Feferman 1962) such that levels 
up to an ordinal given by a primitive recursive ordering are admitted 
if at an earlier stage that ordering has been proved to be well-founded. 
This led to a precise characterization of the predicatively provable state¬ 
ments of ramified analysis ( Feferman 1964 ; Schiitte 1965, 1965a). The 
same idea is applied to the admission of stages in the progression of the¬ 
ories, and it is therefore applicable to unramified theories, which allowed 
Feferman to extend his characterization to the usual language of anal¬ 
ysis {1964) and to set theory {1966, 1974). The results of Feferman 
and Schiitte all point to the conclusion that an ordering that can be 
predicatively proved to be a well-ordering is of type less than a certain 
recursive ordinal To, the first strongly critical number. 8 


6. The simple theory of types 

What is of greatest interest in Godel’s discussion of the simple theory 
is his questioning of it as a theory of concepts and the hint for a possible 
type-free theory commented on above (§4). It should be noted that he 
motivates his suggestion by means of Russell’s idea that propositional 
functions have limited ranges of significance (149-150). 


7. Analyticity 

Godel now turns to “the question whether (and in which sense) the 
axioms of Principia can be considered to be analytic”. In a first sense— 
roughly, reducibility by explicit or contextual definitions to instances of 
the law of identity—he says that even arithmetic is demonstrably non- 
analytic because of its undecidability. This sense is of interest because it 
seems to be directly inspired by Leibniz. If infinite reduction, with inter¬ 
mediary sentences of infinite length, is allowed (as would be suggested 
by Leibniz’s theory of contingent propositions), then all the axioms of 
Principia can be proved analytic, but the proof would require “the whole 
of mathematics ... e.g., the axiom of choice can be proved to be analytic 


of language strata can be carried, 1955, p. 189.) Kreisel’s attitude was in line with 
Godel’s realism and may have been influenced by it. But it should be noted that the 
notions involved can still be understood constructively. 

s Feferman 1964, part II; see also Schiitte 1977, chapter VIII. 
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only if it is assumed to be true” (151). This remark anticipates later 
arguments criticizing the thesis that mathematics is analytic, such as 
Quine 1960. Godel continues: “In a second sense a proposition is called 
analytic if it holds ‘owing to the meaning of the concepts occurring in 
it’, where this meaning may perhaps be undefinable (i.e., irreducible 
to anything more fundamental).” In this sense, Godel affirms the an- 
alyticity of the axioms of the first edition of Principia other than the 
axiom of infinity for two interpretations, “namely if the term ‘predica¬ 
tive function’ is replaced either by ‘class’ (in the extensional sense) or 
(leaving out the axiom of choice) by ‘concept’” (151). The first is the 
sort of interpretation suggested by remarks of Russell such as that in 
1908 quoted above. 4 The second prompts the remark that “the meaning 
of the term ‘concept’ seems to imply that every propositional function 
defines a concept”. Godel’s intuitive notion of concept seems in that 
respect to have resembled Frege’s notion of extension." 

For analyticity in this sense, Godel sees the difficulty that “we don’t 
perceive the concepts of ‘concept’ and ‘class’ with sufficient distinctness, 
as is shown by the paradoxes” (151). But, rather than following Russell’s 
reductionism, the actual development of logic (even by Russell in much 
of his work) has consisted in “trying to make the meaning of the terms 
‘class’ and ‘concept’ clearer, and to set up a consistent theory of classes 
and concepts as objectively existing entities” (152). In spite of the suc¬ 
cess of the simple theory of types and of axiomatic set theory, “many 
symptoms show only too clearly, however, that the primitive concepts 
need further elucidation”. But surely Godel must have seen matters 
differently with respect to the two notions of class and concept: in the 
former, he seems to insist in 1947, there is a well-motivated theory that 
is quite satisfactory as far as it goes; what is lacking in our “perception” 
of the notion of set is intuition regarding the truth of axioms that would 
decide such questions as the continuum problem. In the latter, he had 
no theory to offer that answered to his intuitive notion, and it does not 
appear that such a theory has been constructed since. 


‘Godel’s remarks about the axiom of reducibility commented on above might 
suggest that he had in mind an extensional interpretation that would collapse the 
ramification of the hierarchy. But that in the absence of extensionality such collapse 
does not occur is made clear in Church 1976. It is straightforward to construct 
possible-worlds models of Church’s formulation where reducibility holds but orders 
do not collapse. 

“Except for Frege’s commitment to an extensional language, Godel’s “concepts” 
closely resemble the objects that Frege says are signified by such phrases as “the 
concept horse”. In the Fregean context, they are hardly distinguishable from the 
extensions. Indeed, Godel’s suggestion (l50; see above) that one might assume a 
concept significant everywhere except for certain “singular points” recalls Frege’s 
unsuccessful proposal for a way out of the paradoxes. (See Frege 1903, p. 262, and 
Quine 1955.) 
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The seriousness with which Godel takes the notion of analyticity in 
this section has not attracted the attention of commentators. It seems 
to show a greater engagement with the ideas of the Vienna Circle than 
has usually been attributed to Godel, although there is no doubt that 
the disagreement is deep. With the Vienna Circle and many other ana¬ 
lytic philosophers before the impact of Quine’s criticism, Godel believes 
firmly that mathematical propositions are true by virtue of “the mean¬ 
ing of the concepts occurring” in them, though others might have said 
“words” instead of “concepts”, but he denies that mathematics is true 
by convention (as perhaps it would be if it were analytic in his first sense) 
or that its truth is constituted by linguistic rules that we lay down or 
embody in our usage. v Thus, he says, this position does not contradict 
his view that “mathematics is based on axioms with a real content”, 
since the existence of the concepts involved would have to be an axiom 
of this kind (151, note 47), presumably if one were undertaking to de¬ 
rive the truth of the axioms from their being in some way implied by 
the concepts. w 


8. Concluding remarks 

A sort of coda to this intricate paper is formed by Godel’s closing re¬ 
marks noting that mathematical logic had not yet come close to fulfilling 
the hopes of “Peano and others” that it would contribute to the solu¬ 
tion of problems in mathematics, and attributing this to “incomplete 
understanding of the foundations”: “For how can one hope to solve 
mathematical problems by mere analysis of the concepts occurring, if 
our analysis so far does not even suffice to set up the axioms?” (152). 
His suggestion that the hopes expressed by Leibniz for his characteris- 
tica universalis might, after all, be realistic is one of his most striking 


v Godel’s annotation in reprint A to note 47 seems to reject the possibility that by 
virtue of meaning every mathematical proposition can be “reduced to a special case 
of a = a". This thesis is mentioned in the remarks in E on Bemays 1946. Bernays ap¬ 
pears to argue against it, using Frege’s distinction of sense and signification (Godel’s 
translation of Bedeutung ): since the transformations that reduce a mathematical 
proposition to an identity will in general not preserve sense, the “meaning” by virtue 
of which the reduction proceeds can only be signification 11946, p. 78). On the latter 
reading the thesis reduces to triviality. 

There is no way of knowing whether this argument influenced Godel. It seems to 
me to be unconvincing. That P can be reduced to Q by virtue of the sense of P need 
not imply that the sense of Q is the same as that of P. 

w Godel evidently elaborated further on this issue in the unfinished paper, “Is 
mathematics syntax of language?” undertaken in the mid-1950s for Sckilpp 1963. 
(See Wang 1981, p. 658.) From a very brief examination, it seems to reinforce the 
points of agreement and disagreement with the Vienna Circle noted in the text. 
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and enigmatic utterances. It is known that Godel was for a time much 
occupied with the study of Leibniz and that he regarded Leibniz as 
the greatest influence on him of the philosophers of the past. But the 
strongly Leibnizian flavor of the last pages of the present paper recedes 
in his later writings (except perhaps 1946), and about the substance of 
his reflections on Leibniz little is known. x 


Charles Parsons y 


x But see also Wang 1981, p. 657, n. 8. 

y I am much indebted to John W. Dawson, Jr., for assistance, and to Burton 
Dreben, Wilfried Sieg, Hao Wang, and the editors (especially Solomon Feferman) for 
comments and suggestions. Without the work of the late Jean van Heijenoort on 
Godel’s annotations to A-E, my own brief comments on them would not have been 
possible. 



Russell’s mathematical logic* 

(1944) 


Mathematical logic, which is nothing else but a precise and complete 
formulation of formal logic, has two quite different aspects. On the one 
hand, it is a section of mathematics treating of classes, relations, combina¬ 
tions of symbols, etc., instead of numbers, functions, geometric figures, 
etc. On the other hand, it is a science prior to all others, which contains 
the ideas and principles underlying all sciences. It was in this second sense 
that mathematical logic was first conceived by Leibniz in his Characteristica 
universalis, of which it would have formed a central part. But it was almost 
two centuries after his death before his idea of a logical calculus really 
sufficient for the kind of reasoning occurring in the exact sciences was put 
into effect (in some form at least, if not the one Leibniz had in mind) by 
Frege and Peano. 1 Frege was chiefly interested in the analysis of thought 
and used his calculus in the first place for deriving arithmetic from pure 
logic. Peano, on the other hand, was more interested in its applications 
within mathematics and created an elegant and flexible symbolism, which 
permits expressing even the most complicated mathematical theorems in a 
perfectly precise and often very concise manner by single formulas. 

It was in this line of thought of Frege and Peano that Russell’s work 
set in. Frege, in consequence of his painstaking analysis of the proofs, had 
not gotten beyond the most elementary properties of the series of integers, 
while Peano had accomplished a big collection of mathematical theorems 
expressed | in the new symbolism, but without proofs. It was only in Prin- 126 
cipia mathematica that full use was made of the new method for actually 
deriving large parts of mathematics from a very few logical concepts and 


*[Author’s addition of 1964, expanded in 1972'. The author wishes to note (1) that 
since the original publication of this paper advances have been made in some of the 
problems discussed and that the formulations given could be improved in several places, 
and (2) that the term “constructivistic” in this paper is used for a strictly nominal¬ 
istic kind of constructivism, such as that embodied in Russell’s “no class theory”. Its 
meaning, therefore, is very different from that used in current discussions on the founda¬ 
tions of mathematics, i.e., from both “intuitionistically admissible” and “constructive” 
in the sense of the Hilbert School. Both these schools base their constructions on a 
mathematical intuition whose avoidance is exactly one of the principal aims of Russell’s 
constructivism (see the first alternative in the last sentence of footnote 23 below). What, 
in Russell’s own opinion, can be obtained by his constructivism (which might better be 
called fictionalism) is the system of finite orders of the ramified hierarchy without the 
axiom of infinity for individuals. The explanation of the term constructive given in 
footnote 22 below is to be replaced by the remarks just made.] 

1 Frege has doubtless the priority, since his first publication about the subject, which 
already contains all the essentials, appeared ten years before Peano’s. 
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axioms. In addition, the young science was enriched by a new instrument, 
the abstract theory of relations. The calculus of relations had been de¬ 
veloped before by Peirce and Schroder, but only with certain restrictions 
and in too close analogy with the algebra of numbers. In Principia not 
only Cantor’s set theory but also ordinary arithmetic and the theory of 
measurement are treated from this abstract relational standpoint. 

It is to be regretted that this first comprehensive and thorough-going 
presentation of a mathematical logic and the derivation of mathematics 
from it so greatly lacking in formal precision in the foundations (contained 
in *1-*21 of Principia ) that it presents in this respect a considerable step 
backwards as compared with Frege. What is missing, above all, is a precise 
statement of the syntax of the formalism. Syntactical considerations are 
omitted even in cases where they are necessary for the cogency of the proofs, 
in particular in connection with the “incomplete symbols”. These are in¬ 
troduced not by explicit definitions, but by rules describing how sentences 
containing them are to be translated into sentences not containing them. 
In order to be sure, however, that (or for what expressions) this transla¬ 
tion is possible and uniquely determined and that (or to what extent) the 
rules of inference apply also the new kind of expressions, it is necessary to 
have a survey of all possible expressions, and this can be furnished only by 
syntactical considerations. The matter is especially doubtful for the rule 
of substitution and of replacing defined symbols by their definiens. If this 
latter rule is applied to expressions containing other defined symbols it re¬ 
quires that the order of elimination of these be indifferent. This however is 
by no means always the case = u[$!u], e.g., is a counter-example). In 
Principia such eliminations are always carried out by substitutions in the 
theorems corresponding to the definitions, so that it is chiefly the rule of 
substitution which would have to be proved. 

127 I do not want, however, to go into any more details about | either the 
formalism or the mathematical content of Principia , 2 but want to devote 
the subsequent portion of this essay to Russell’s work concerning the analy¬ 
sis of the concepts and axioms underlying mathematical logic. In this field 
Russell had produced a great number of interesting ideas some of which 
are presented most clearly (or are contained only) in his earlier writings. I 
shall therefore frequently refer also to these earlier writings, although their 
content may partly disagree with Russell’s present standpoint. 

What strikes one as surprising in this field is Russell’s pronouncedly 
realistic attitude, which manifests itself in many passages of his writings. 
“Logic is concerned with the real world just as truly as zoology, though with 
its more abstract and general features”, he says, e.g., in his Introduction to 
mathematical philosophy (edition of 1920, page 169). It is true, however, 


J Cf. in this respect Quine 1941■ 
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that this attitude has been gradually decreasing in the course of time 3 
and also that it always was stronger in theory than in practice. When 
he started on a concrete problem, the objects to be analyzed (e.g., the 
classes or propositions) soon for the most part turned into “logical fictions”. 
Though perhaps this need not necessarily mean (according to the sense in 
which Russell uses this term) that these things do not exist, but only that 
we have no direct perception of them. 

The analogy between mathematics and a natural science is enlarged 
upon by Russell also in another respect (in one of his earlier writings). 
He compares the axioms of logic and mathematics with the laws of nature 
and logical evidence with sense perception, so that the axioms need not 
necessarily be evident in themselves, but rather their justification lies (ex¬ 
actly as in physics) in the fact that they make it possible for these “sense 
perceptions” to be deduced; which of course would not exclude that they 
also have a kind of intrinsic plausibility similar to that in physics. I think 
that (provided “evidence” is understood in a sufficiently strict sense) this 
view has been largely justified by subsequent developments, and it is to 
be expected that it will be still more so in the future. It has turned out 
that (under the | assumption that modern mathematics is consistent) the 
solution of certain arithmetical problems requires the use of assumptions 
essentially transcending arithmetic, i.e., the domain of the kind of elemen¬ 
tary indisputable evidence that may be most fittingly compared with sense 
perception. Furthermore it seems likely that for deciding certain questions 
of abstract set theory and even for certain related questions of the theory 
of real numbers new axioms based on some hitherto unknown idea will be 
necessary. Perhaps also the apparently unsurmountable difficulties which 
some other mathematical problems have been presenting for many years 
are due to the fact that the necessary axioms have not yet been found. Of 
course, under these circumstances mathematics may lose a good deal of its 
“absolute certainty”; but, under the influence of the modern criticism of 
the foundations, this has already happened to a large extent. There is some 
resemblance between this conception of Russell and Hilbert’s “supplement¬ 
ing the data of mathematical intuition” by such axioms as, e.g., the law 
of excluded middle which are not given by intuition according to Hilbert’s 
view; the borderline, however, between data and assumptions would seem 
to lie in different places according to whether we follow Hilbert or Russell. 

An interesting example of Russell’s analysis of the fundamental logical 
concepts is his treatment of the definite article “the”. The problem is: 
what do the so-called descriptive phrases (i.e., phrases as, e.g., “the author 


3 The above quoted passage was left out in the later editions of the Introduction. 
[Blackwell (1976) has observed that Godel was mistaken on this factual matter.] 
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of Waverley” or “the king of England”) denote or signify 4 and what is the 
meaning of sentences in which they occur? The apparently obvious answer 
that, e.g., “the author of Waverletf’ signifies Walter Scott, leads to unex¬ 
pected difficulties. For, if we admit the further apparently obvious axiom, 
that the signification of a composite expression, containing constituents 
which have themselves a signification, depends only on the signification 
of these constituents (not on the manner in which this signification is ex¬ 
pressed), then it follows that the sentence “Scott is the author of Waverley'' 
129 signifies the same thing as “Scott is Scott”; and this again leads | almost 
inevitably to the conclusion that all true sentences have the same significa¬ 
tion (as well as all false ones). 5 Frege actually drew this conclusion; and he 
meant it in an almost metaphysical sense, reminding one somewhat of the 
Eleatic doctrine of the “One”. “The True”—according to Frege’s view—is 
analyzed by us in different ways in different propositions, “the True” being 
the name he uses for the common signification of all true propositions. 6 

Now, according to Russell, what corresponds to sentences in the outer 
world is facts. However, he avoids the term “signify” or “denote” and 
uses “indicate” instead (in his earlier papers he uses “express” or “being 
a symbol for”), because he holds that the relation between a sentence and 
a fact is quite different from that of a name to the thing named. Fur¬ 
thermore, he uses “denote” (instead of “signify”) for the relation between 
things and names, so that “denote” and “indicate” together would corre¬ 
spond to Frege’s “bedeuten”. So, according to Russell’s terminology and 
view, true sentences “indicate” facts and, correspondingly, false ones in¬ 
dicate nothing. 7 Hence Frege’s theory would in a sense apply to false 
sentences, since they all indicate the same thing, namely nothing. But 


4 I use the term “signify” in the sequel because it corresponds to the German word 
“bedeuten which Frege, who first treated the question under consideration, used in this 
connection. 

5 The only further assumptions one would need in order to obtain a rigorous proof 
would be (1) that “<f>(a)” and the proposition “a is the object which has the property <j> 
and is identical with a” means the same thing and (2) that every proposition “speaks 
about something”, i.e., can be brought to the form 4>(a). Furthermore one would have 
to use the fact that for any two objects a, b, there exists a true proposition of the form 
<j>{a, b) as, e.g., a -£ b or a = a.b = 6. 

6 Cf. Frege 1892, p. 35. 

7 From the indication ( Bedeutung ) of a sentence is to be distinguished what Frege 
called its meaning (Sinn), which is the conceptual correlate of the objectively exist¬ 
ing fact (or “the True”). This one should expect to be in Russell’s theory a possible 
fact (or rather the possibility of a fact), which would exist also in the case of a false 
proposition. But Russell, as he says, could never believe that such “curious shadowy” 
things really exist. Thirdly, there is also the psychological correlate of the fact which 
is called “signification” and understood to be the corresponding belief in Russell’s lat¬ 
est book An inquiry into meaning and truth [1940]. “Sentence”, in contradistinction to 
“proposition”, is used to denote the mere combination of symbols. 
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different true sentences may indicate many different things. Therefore this 
view concerning sentences makes it necessary either to drop the above men¬ 
tioned principle about the signification (i.e., in Rus|selTs terminology the 130 
corresponding one about the denotation and indication) of composite ex¬ 
pressions or to deny that a descriptive phrase denotes the object described. 
Russell did the latter 8 by taking the viewpoint that a descriptive phrase 
denotes nothing at all but has meaning only in context; for example, the 
sentence “the author of Waverley is Scotch” is defined to mean: “There 
exists exactly one entity who wrote Waverley and whoever wrote Waverley 
is Scotch.” This means that a sentence involving the phrase “the author of 
Waverletf does not (strictly speaking) assert anything about Scott (since 
it contains no constituent denoting Scott), but is only a roundabout way of 
asserting something about the concepts occurring in the descriptive phrase. 
Russell adduces chiefly two arguments in favor of this view, namely (1) 
that a descriptive phrase may be meaningfully employed even if the ob¬ 
ject described does not exist (e.g., in the sentence: “The present king of 
Prance does not exist.”); (2) that one may very well understand a sentence 
containing a descriptive phrase without being acquainted with the object 
described, whereas it seems impossible to understand a sentence without 
being acquainted with the objects about which something is being asserted. 

The fact that Russell does not consider this whole question of the interpre¬ 
tation of descriptions as a matter of mere linguistic conventions, but rather 
as a question of right and wrong, is another example of his realistic atti¬ 
tude, unless perhaps he was aiming at a merely psychological investigation 
of the actual processes of thought. As to the question in the logical sense, I 
cannot help feeling that the problem raised by Frege’s puzzling conclusion 
has only been evaded by Russell’s theory of descriptions and that there is 
something behind it which is not yet completely understood. 

There seems to be one purely formal respect in which one may give 
preference to Russell’s theory of descriptions. By defining the meaning 
of sentences involving descriptions in the above manner, he avoids in his 
logical system any axioms about the particle “the”, i.e., the analyticity of 
the theorems about “the” is made explicit; they can be shown to follow 
from | the explicit definition of the meaning of sentences involving “the”. 131 
Frege, on the contrary, has to assume an axiom about “the”, which of 
course is also analytic, but only in the implicit sense that it follows from 
the meaning of the undefined terms. Closer examination, however, shows 
that this advantage of Russell’s theory over Frege’s subsists only as long 
as one interprets definitions as mere typographical abbreviations, not as 


8 He made no explicit statement about the former; but it seems it would hold for the 
logical system of Principia, though perhaps more or less vacuously. 
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introducing names for objects described by the definitions, a feature which 
is common to Frege and Russell. 

I pass now to the most important of Russell’s investigations in the field 
of the analysis of the concepts of formal logic, namely those concerning the 
logical paradoxes and their solution. By analyzing the paradoxes to which 
Cantor’s set theory had led, he freed them from all mathematical technicali¬ 
ties, thus bringing to light the amazing fact that our logical intuitions (i.e., 
intuitions concerning such notions as: truth, concept, being, class, etc.) 
are self-contradictory. He then investigated where and how these common- 
sense assumptions of logic are to be corrected and came to the conclusion 
that the erroneous axiom consists in assuming that for every propositional 
function there exists the class of objects satisfying it, or that every proposi¬ 
tional function exists “as a separate entity” ; 9 by which is meant something 
separable from the argument (the idea being that propositional functions 
are abstracted from propositions which are primarily given) and also some¬ 
thing distinct from the combination of symbols expressing the propositional 
function; it is then what one may call the notion or concept defined by it. 10 
The existence of this concept already suffices for the paradoxes in their 
132 “intensional” form, where the concept of | “not applying to itself” takes 
the place of Russell’s paradoxical class. 

Rejecting the existence of a class or concept in general, it remains to 
determine under what further hypotheses (concerning the propositional 
function), these entities do exist. Russell pointed out (loc. cit.) two possible 
directions in which one may look for such a criterion, which he called 
the zig-zag theory and the theory of limitation of size, respectively, and 
which might perhaps more significantly be called the intensional and the 
extensional theory. The second one would make the existence of a class or 
concept depend on the extension of the propositional function (requiring 
that it be not too big), the first one on its content or meaning (requiring a 
certain kind of “simplicity”, the precise formulation of which would be the 
problem). 

The most characteristic feature of the second (as opposed to the first) 
would consist in the non-existence of the universal class or (in the inten¬ 
sional interpretation) of the notion of “something” in an unrestricted sense. 


9 In Russell’s first paper about the subject (1906). If one wants to bring such para¬ 
doxes as “the liar” under his viewpoint, universal (and existential) propositions must be 
considered to involve the class of objects to which they refer. 

10 “Propositional function” (without the clause “as a separate entity”) may be un¬ 
derstood to mean a proposition in which one or several constituents are designated as 
arguments. One might think that the pair consisting of the proposition and the argu¬ 
ment could then for all purposes play the role of the “propositional function as a separate 
entity”, but it is to be noted that this pair (as one entity) is again a set or a concept 
and therefore need not exist. 
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Axiomatic set theory as later developed by Zermelo and others can be 
considered as an elaboration of this idea as far as classes are concerned. 11 
In particular the phrase “not too big” can be specified (as was shown by J. 
von Neumann 12 ) to mean: not equivalent with the universe of all things, 
or to be more exact, a propositional function can be assumed to determine 
a class when and only when there exists no relation (in intension, i.e., a 
propositional function with two variables) which associates, in a one-to-one 
manner with each object, an object satisfying the propositional function 
and vice versa. This criterion, however, does not appear as the basis of the 
theory but as a consequence of the axioms and inversely can replace two 
of the axioms (the axiom of replacement and that of choice). 

For the second of Russell’s suggestions too, i.e., for the zig-zag theory, 
there has recently been set up a logical system which shares some essential 
features with this scheme, namely, | Quine’s system. 13 It is, moreover, not 133 
unlikely that there are other interesting possibilities along these lines. 

Russell’s own subsequent work concerning the solution of the paradoxes 
did not go in either of the two afore-mentioned directions pointed out by 
himself, but was largely based on a more radical idea, the “no-class theory”, 
according to which classes or concepts never exist as real objects, and 
sentences containing these terms sure meaningful only to such an extent as 
they can be interpreted as a fagon de parler, a manner of speaking about 
other things (cf. page 141). Since in Principia and elsewhere, however, he 
formulated certain principles discovered in the course of the development of 
this theory as general logical principles without mentioning any longer their 
dependence on the no-class theory, I am going to treat of these principles 
first. 

I mean in particular the vicious circle principle, which forbids a certain 
kind of “circularity” which is made responsible for the paradoxes. The 
fallacy in these, so it is contended, consists in the circumstance that one 
defines (or tacitly assumes) totalities, whose existence would entail the 
existence of certain new elements of the same totality, namely elements 
definable only in terms of the whole totality. This led to the formulation 
of a principle which says that “no totality can contain members definable 
only in terms of this totality, or members involving or presupposing this 
totality” (vicious circle principle). In order to make this principle applica¬ 
ble to the intensional paradoxes, still another principle had to be assumed, 
namely that “every propositional function presupposes the totality of its 


11 The intensional paradoxes can be dealt with e.g. by the theory of simple types or 
the ramified hierarchy, which do not involve any undesirable restrictions if applied to 
concepts only and not to sets. 

12 Cf. von Neumann 1929. 

13 Cf. Quine 1937. 
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values” and therefore evidently also the totality of its possible arguments. 14 
(Otherwise the concept of “not applying to itself’ would presuppose no to¬ 
tality (since it involves no quantifications), 15 and the vicious circle principle 
would not prevent its application to itself.) A corresponding vicious circle 

134 principle | for propositional functions which says that nothing defined in 
terms of a propositional function can be a possible argument of this func¬ 
tion is then a consequence. 16 The logical system to which one is led on the 
basis of these principles is the theory of orders in the form adopted, e.g., in 
the first edition of Principia , according to which a propositional function 
which either contains quantifications referring to propositional functions of 
order n or can be meaningfully asserted of propositional functions of order 
n is at least of order n + 1, and the range of significance of a propositional 
function as well as the range of a quantifier must always be confined to a 
definite order. 

In the second edition of Principia , however, it is stated in the Introduc¬ 
tion (pages xl and xli) that “in a limited sense” also functions of a higher 
order than the predicate itself (therefore also functions defined in terms 
of the predicate as, e.g., in p‘n e k) can appear as arguments of a predi¬ 
cate of functions; and in Appendix B such things occur constantly. This 
means that the vicious circle principle for propositional functions is virtu¬ 
ally dropped. This change is connected with the new axiom that functions 
can occur in propositions only “through their values”, i.e., extensionally, 
which has the consequence that any propositional function can take as an 
argument any function of appropriate type, whose extension is defined (no 
matter what order of quantifiers is used in the definition of this extension). 
There is no doubt that these things are quite unobjectionable even from 
the constructive standpoint (see page 136), provided that quantifiers are 
always restricted to definite orders. The paradoxes are avoided by the the- 

135 ory of simple types, 17 which in | Principia is combined with the theory of 


14 Cf. Whitehead and Russell 1925, p. 39. 

15 Quantifiers are the two symbols (3a:) and (x), meaning respectively “there exists 
an object x ” and “for all objects x". The totality of objects x to which they refer is 
called their range. 

16 Cf. Whitehead and Russell 1925, p. 47, section IV. 

17 By the theory of simple types I mean the doctrine which says that the objects of 
thought (or, in another interpretation, the symbolic expressions) are divided into types, 
namely: individuals, properties of individuals, relations between individuals, properties 
of such relations, etc. (with a similar hierarchy for extensions), and that sentences of 
the form: “a has the property </>”, “6 bears the relation R to c”, etc. are meaningless, 
if a, b, c, R, 4> are not of types fitting together. Mixed types (such as classes containing 
individuals and classes as elements) and therefore also transfinite types (such as the class 
of all classes of finite types) are excluded. That the theory of simple types suffices for 
avoiding also the epistemological paradoxes is shown by a closer analysis of these. (Cf. 
Ramsey 1926 and Tarski 1935, p. 399. 
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orders (giving as a result the “ramified hierarchy”) but is entirely indepen¬ 
dent of it and has nothing to do with the vicious circle principle (cf. page 
147). 

Now as to the vicious circle principle proper, as formulated on page 133, 
it is first to be remarked that, corresponding to the phrases “definable only 
in terms of”, “involving”, and “presupposing”, we have really three differ¬ 
ent principles, the second and third being much more plausible than the 
first. It is the first form which is of particular interest, because only this 
one makes impredicative definitions 18 impossible and thereby destroys the 
derivation of mathematics from logic, effected by Dedekind and Frege, and 
a good deal of modern mathematics itself. It is demonstrable that the for¬ 
malism of classical mathematics does not satisfy the vicious circle principle 
in its first form, since the axioms imply the existence of real numbers defin¬ 
able in this formalism only by reference to all real numbers. Since classical 
mathematics can be built up on the basis of Principia (including the axiom 
of reducibility), it follows that even Principia (in the first edition) does not 
satisfy the vicious circle principle in the first form, if “definable” means 
“definable within the system” and no methods of defining outside the sys¬ 
tem (or outside other systems of classical mathematics) are known except 
such as involve still more comprehensive totalities than those occurring in 
the systems. 

I would consider this rather as a proof that the vicious circle principle 
is false than that classical mathematics is false, and this is indeed plausi¬ 
ble also on its own account. For, first of all one may, on good grounds, 
deny that reference to a totality necessarily implies reference to all single 
elements of it or, in other words, that “all” means the same as an infi¬ 
nite logical | conjunction. One may, e.g., follow Langford’s and Carnap’s 19 
suggestion to interpret “all” as meaning analyticity or necessity or demon- 
strability. There are difficulties in this view; but there is no doubt that in 
this way the circularity of impredicative definitions disappears. 

Secondly, however, even if “all” means an infinite conjunction, it seems 
that the vicious circle principle in its first form applies only if the entities 
involved are constructed by ourselves. In this case there must clearly exist a 
definition (namely the description of the construction) which does not refer 
to a totality to which the object defined belongs, because the construction 
of a thing can certainly not be based on a totality of things to which 
the thing to be constructed itself belongs. If, however, it is a question of 


18 These are definitions of an object a by reference to a totality to which a itself (and 
perhaps also things definable only in terms of a) belong. As, e.g., if one defines a class a 
as the intersection of all classes satisfying a certain condition <f> and then concludes that 
a is a subset also of such classes u as defined in terms of a (provided they satisfy </>). 

19 See Carnap 1931, p. 103, and 1937, p. 162, and Langford 1927, p. 599. 
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objects that exist independently of our constructions, there is nothing in 
the least absurd in the existence of totalities containing members which 
can be described (i.e., uniquely characterized) 20 only by reference to this 
totality. 21 Such a state of affairs would not even contradict the second form 
of the vicious circle principle, since one cannot say that an object described 
by reference to a totality “involves” this totality, although the description 
itself does; nor would it contradict the third form, if “presuppose” means 
“presuppose for the existence” not “for the knowability”. 

So it seems that the vicious circle principle in its first form applies only 
if one takes the constructivistic (or nominalistic) standpoint 22 toward the 
objects of logic and mathematics, in particular toward propositions, classes 
and notions, e.g., if one understands by a notion a symbol together with 
a rule for translating sentences containing the symbol into such sentences 
137 as do | not contain it, so that a separate object denoted by the symbol 
appears as a mere fiction. 23 

Classes and concepts may, however, also be conceived as real objects, 
namely classes as “pluralities of things” or as structures consisting of a 
plurality of things and concepts as the properties and relations of things 
existing independently of our definitions and constructions. 

It seems to me that the assumption of such objects is quite as legitimate 
as the assumption of physical bodies and there is quite as much reason to 
believe in their existence. They are in the same sense necessary to obtain 
a satisfactory system of mathematics as physical bodies are necessary for a 
satisfactory theory of our sense perceptions and in both cases it is impossi¬ 
ble to interpret the propositions one wants to assert about these entities as 
propositions about the “data”, i.e., in the latter case the actually occurring 
sense perceptions. Russell himself concludes in the last chapter of his book 
on Meaning and truth [1940J, though “with hesitation”, that there exist 
“universals”, but apparently he wants to confine this statement to concepts 
of sense perceptions, which does not help the logician. I shall use the term 
“concept” in the sequel exclusively in this objective sense. One formal 
difference between the two conceptions of notions would be that any two 
different definitions of the form a(x) = <j>{x) can be assumed to define two 


20 An object a is said to be described by a propositional function <j>{x) if <f>{x) is true 
for x = a and for no other object. 

21 Cf. Ramsey 1926. 

22 I shall use in the sequel “constructivism” as a general term comprising both these 
standpoints and also such tendencies as are embodied in Russell’s “no class” theory. 

23 One might think that this conception of notions is impossible, because the sen¬ 
tences into which one translates must also contain notions so that one would get into 
an infinite regress. This, however, does not preclude the possibility of maintaining the 
above viewpoint for all the more abstract notions, such as those of the second and higher 
types, or in fact for all notions except the primitive terms which might be only a very few. 
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different notions a in the constructivistic sense. (In particular this would 
be the case for the nominalistic interpretation of the term “notion” sug¬ 
gested above, since two such definitions give different rules of translation 
for propositions containing a.) For concepts, on the contrary, this is by no 
means the case, since the same thing may be described in different ways. 
It might even be that the axiom of extensionality 24 or at least something 
near to it holds for | concepts. The difference may be illustrated by the 
following definition of the number two: “Two is the notion under which 
fall all pairs and nothing else.” There is certainly more than one notion in 
the constructivistic sense satisfying this condition, but there might be one 
common “form” or “nature” of all pairs. 

Since the vicious circle principle, in its first form, does apply to con¬ 
structed entities, impredicative definitions and the totality of all notions or 
classes or propositions are inadmissible in constructivistic logic. What an 
impredicative definition would require is to construct a notion by a combi¬ 
nation of a set of notions to which the notion to be formed itself belongs. 
Hence if one tries to effect a retranslation of a sentence containing a symbol 
for such an impredicatively defined notion it turns out that what one ob¬ 
tains will again contain a symbol for the notion in question. 25 At least this 
is so if “all” means an infinite conjunction; but Carnap’s and Langford’s 
idea (mentioned on page 136) would not help in this connection, because 
“demonstrability”, if introduced in a manner compatible with the construc¬ 
tivistic standpoint towards notions, would have to be split into a hierarchy 
of orders, which would prevent one from obtaining the desired results. 26 As 
Chwistek has shown, 27 it is even possible under certain assumptions ad¬ 
missible within constructivistic logic to derive an actual contradiction from 
the unrestricted admission of impredicative definitions. To be more spe¬ 
cific, he has shown that the system of simple types becomes contradictory 
if one adds the “axiom of intensionality” which says (roughly speaking) 
that to different definitions belong different notions. This axiom, however, 
as has just been pointed out, can be assumed to hold for notions in the 
constructivistic sense. 

Speaking of concepts, the aspect of the question is changed completely. 
Since concepts are supposed to exist objectively, there seems to be objec¬ 
tion neither to speaking of all of them | (cf. page 143) nor to describing 


24 I.e., that no two different properties belong to exactly the same things, which, in a 
sense, is a counterpart to Leibniz’s Principium identitatis indiscernibilium, which says 
no two different things have exactly the same properties. 

25 Cf. Carnap 1931, p. 103, and 1937, p. 162. 

26 Nevertheless the scheme is interesting because it again shows the constructibility 
of notions which can be meaningfully asserted of notions of arbitrarily high order. 

27 See Chwistek 1933. 
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some of them by reference to all (or at least all of a given type). But, 
one may ask, isn’t this view refutable also for concepts because it leads to 
the “absurdity” that there will exist properties <j) such that </>(a) consists 
in a certain state of affairs involving all properties (including <j> itself and 
properties defined in terms of (j>), which would mean that the vicious circle 
principle does not hold even in its second form for concepts or proposi¬ 
tions? There is no doubt that the totality of all properties (or of all those 
of a given type) does lead to situations of this kind, but I don’t think they 
contain any absurdity. 28 It is true that such properties <j> (or such proposi¬ 
tions (j>(a)) will have to contain themselves as constituents of their content 
(or of their meaning), and in fact in many ways, because of the proper¬ 
ties defined in terms of <j>] but this only makes it impossible to construct 
their meaning (i.e., explain it as an assertion about sense perceptions or 
any other non-conceptual entities), which is no objection for one who takes 
the realistic standpoint. Nor is it self-contradictory that a proper part 
should be identical (not merely equal) to the whole, as is seen in the case 
of structures in the abstract sense. The structure of the series of integers, 
e.g., contains itself as a proper part and it is easily seen that there exist 
also structures containing infinitely many different parts, each containing 
the whole structure as a part. In addition there exist, even within the do¬ 
main of constructivistic logic, certain approximations to this self-reflexivity 
of impredicative properties, namely propositions which contain as parts of 
their meaning not themselves but their own formal demonstrability. 29 Now 
formal demonstrability of a proposition (in case the axioms and rules of in- 
140 ference are correct) implies this proposition and in many cases is equivalent 
to it. Furthermore, there doubtlessly exist sentences referring to a totality 
of sentences to which they themselves belong as, e.g., the sentence: “Every 
sentence (of a given language) contains at least one relation word.” 

Of course, this view concerning the impredicative properties makes it 
necessary to look for another solution of the paradoxes, according to which 
the fallacy (i.e., the underlying erroneous axiom) does not consist in the 
assumption of certain self-reflexivities of the primitive terms but in other 
assumptions about these. Such a solution may be found for the present in 
the simple theory of types and in the future perhaps in the development 
of the ideas sketched on pages 132 and 150. Of course, all this refers only 


28 The formal system corresponding to this view would have, instead of the axiom of 
reducibility, the rule of substitution for functions described, e.g., in Hilbert and Bernays 
1934, p- 90, applied to variables of any type, together with certain axioms of inten- 
sionality required by the concept of property which, however, would be weaker than 
Chwistek’s. It should be noted that this view does not necessarily imply the existence 
of concepts which cannot be expressed in the system, if combined with a solution of the 
paradoxes along the lines indicated on p. 149. 

29 Cf. my 1931, p. 173, or Carnap 1937, §35. 
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to concepts. As to notions in the constructivistic sense, there is no doubt 
that the paradoxes are due to a vicious circle. It is not surprising that 
the paradoxes should have different solutions for different interpretations 
of the terms occurring. 

As to classes in the sense of pluralities or totalities, it would seem that 
they are likewise not created but merely described by their definitions and 
that therefore the vicious circle principle in the first form does not apply. 

I even think there exist interpretations of the term “class” (namely as a 
certain kind of structures) where it does not apply in the second form 
either. 30 But for the development of all contemporary mathematics one 
may even assume that it does apply in the second form, which for classes 
as mere pluralities is, indeed, a very plausible assumption. One is then led 
to something like Zermelo’s axiom system for set theory, i.e., the sets are 
split up into “levels” in such a manner that only sets of lower levels can 
be elements of sets of higher levels (i.e., x e y is always false if x belongs 
to a higher level than y). There is no reason for classes in this sense to 
exclude mixtures of levels in one set and transfinite levels. The place of 
the axiom of reducibility is now taken by the axiom | of classes (Zermelo’s 141 
Aussonderungsaxiom) which says that for each level there exists for an 
arbitrary propositional function <f>(x) the set of those x of this level for 
which <j){x ) is true, and this seems to be implied by the concept of classes 
as pluralities. 

Russell adduces two reasons against the extensional view of classes, 
namely, the existence of (1) the null class, which cannot very well be a 
collection, and (2) the unit classes, which would have to be identical with 
their single elements. But it seems to me that these arguments could, if 
anything, at most prove that the null class and the unit classes (as distinct 
from their only element) are fictions (introduced to simplify the calculus 
like the points at infinity in geometry), not that all classes are fictions. 

But in Russell the paradoxes had produced a pronounced tendency to 
build up logic as far as possible without the assumption of the objective ex¬ 
istence of such entities as classes and concepts. This led to the formulation 
of the aforementioned “no class theory”, according to which classes and con¬ 
cepts were to be introduced as a fagon de parler. But propositions, too, (in 
particular those involving quantifications) 31 were later on largely included 
in this scheme, which is but a logical consequence of this standpoint, since 
e.g., universal propositions as objectively existing entities evidently belong 
to the same category of idealistic objects as classes and concepts and lead 
to the same kind of paradoxes, if admitted without restrictions. As regards 


30 Ideas tending in this direction are contained in Mirimanoff 191 7, 1917a, and 1920- 
Cf. in particular 1917a, p. 212. 

31 Cf. Russell 1906a. 
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classes, this program was actually carried out; i.e., the rules for translating 
sentences containing class names or the term “class” into such as do not 
contain them were stated explicitly; and the basis of the theory, i.e., the 
domain of sentences into which one has to translate, is clear, so that classes 
can be dispensed with (within the system Principia), but only if one as¬ 
sumes the existence of a concept whenever one wants to construct a class. 
When it comes to concepts and the interpretation of sentences containing 
this or some synonymous term, the state of affairs is by no means as clear. 

142 First of all, some of them | (the primitive predicates and relations such 
as “red” or “colder”) must apparently be considered as real objects; 32 the 
rest of them (in particular according to the second edition of Principia, all 
notions of a type higher than the first and therewith all logically interesting 
ones) appear as something constructed (i.e., as something not belonging to 
the “inventory” of the world); but neither the basic domain of propositions 
in terms of which finally everything is to be interpreted, nor the method of 
interpretation is as clear as in the case of classes (see below). 

This whole scheme of the no-class theory is of great interest as one of 
the few examples, carried out in detail, of the tendency to eliminate as¬ 
sumptions about the existence of objects outside the “data” and to replace 
them by constructions on the basis of these data. 33 The result has been 
in this case essentially negative; i.e., the classes and concepts introduced 
in this way do not have all the properties required for their use in math¬ 
ematics, unless one either introduces special axioms about the data (e.g., 
the axiom of reducibility), which in essence already mean the existence in 
the data of the kind of objects to be constructed, or makes the fiction that 
one can form propositions of infinite (and even non-denumerable) length, 34 
i.e., operates with truth-functions of infinitely many arguments, regardless 
of whether or not one can construct them. But what else is such an infi¬ 
nite truth-function but a special kind of an infinite extension (or structure) 
and even a more complicated one than a class, endowed in addition with a 
hypothetical meaning, which can be understood only by an infinite mind? 
All this is only a verification of the view defended above that logic and 
mathematics (just as physics) are built up on axioms with a real content 
which cannot be “explained away”. 

What one can obtain on the basis of the constructivistic attitude is the 

143 theory of orders (cf. page 134); only now (and this | is the strong point of 


32 In Appendix C of Principia a way is sketched by which these also could be con¬ 
structed by means of certain similarity relations between atomic propositions, so that 
these latter would be the only ones remaining as real objects. 

33 The “data” are to be understood in a relative sense here; i.e., in our case as logic 
without the assumption of the existence of classes and concepts. 

34 Cf. Ramsey 1926. 
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the theory) the restrictions involved do not appear as ad hoc hypotheses for 
avoiding the paradoxes, but as unavoidable consequences of the thesis that 
classes, concepts, and quantified propositions do not exist as real objects. 

It is not as if the u ni verse of things were divided into orders and then one 
were prohibited to speak of all orders; but, on the contrary, it is possible to 
speak of all existing things; only, classes and concepts are not among them; 
and if they are introduced as a fagon de parler, it turns out that this very 
extension of the symbolism gives rise to the possibility of introducing them 
in a more comprehensive way, and so on indefinitely. In order to carry 
out this scheme one must, however, presuppose arithmetic (or something 
equivalent), which only proves that not even this restricted logic can be 
built up on nothing. 

In the first edition of Principia, where it was a question of actually 
building up logic and mathematics, the constructivistic attitude was, for 
the most part, abandoned, since the axiom of reducibility for types higher 
than the first together with the axiom of infinity makes it absolutely nec¬ 
essary that there exist primitive predicates of arbitrarily high types. What 
is left of the constructive attitude is only: (1) The introduction of classes 
as a fagon de parler-, (2) the definition of ~, V, etc., as applied to propo¬ 
sitions containing quantifiers (which incidentally proved its fecundity in 
a consistency proof for arithmetic); (3) the step by step construction of 
functions of orders higher than 1, which, however, is superfluous owing 
to the axiom of reducibility; (4) the interpretation of definitions as mere 
typographical abbreviations, which makes every symbol introduced by def¬ 
inition an incomplete symbol (not one naming an object described by the 
definition). But the last item is largely an illusion, because, owing to the 
axiom of reducibility, there always exist real objects in the form of primitive 
predicates, or combinations of such, corresponding to each defined symbol. 
Finally also Russell’s theory of descriptions is something belonging to the 
constructivistic order of ideas. 

In the second edition of Principia (or, to be more exact, in the intro¬ 
duction to it) the constructivistic attitude is resumed again. The axiom of 
reducibility is dropped, and it is stated explicitly | that all primitive predi- 144 
cates belong to the lowest type and that the only purpose of variables (and 
evidently also of constants) of higher orders and types is to make it pos¬ 
sible to assert more complicated truth-functions of atomic propositions, 35 
which is only another way of saying that the higher types and orders are 
solely a fagon de parler. This statement at the same time informs us of 
what kind of propositions the basis of the theory is to consist, namely of 
truth-functions of atomic propositions. 


35 I.e., propositions of the form 5(a), R(a, b), etc., where 5, R are primitive predicates 
and a, b individuals. 
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This, however, is without difficulty only if the number of individuals 
and primitive predicates is finite. For the opposite case (which is chiefly of 
interest for the purpose of deriving mathematics) Ramsey (1926) took the 
course of considering our inability to form propositions of infinite length as 
a “mere accident”, to be neglected by the logician. This of course solves 
(or rather cuts through) the difficulties; but is to be noted that, if one 
disregards the difference between finite and infinite in this respect, there 
exists a simpler and at the same time more far reaching interpretation of 
set theory (and therewith of mathematics). Namely, in case of a finite 
number of individuals, Russell’s apergu that propositions about classes can 
be interpreted as propositions about their elements becomes literally true, 
since, e.g., “x e m” is equivalent to 

“x = ai V x = o 2 V • • • V x — a,k ” 

where the a, are the elements of m; and “there exists a class such that 
...” is equivalent to “there exist individuals xi,x%, ...,x n such that .. .”, 36 
provided n is the number of individuals in the world and provided we 
neglect for the moment the null class which would have to be taken care 
of by an additional clause. Of course, by an iteration of this procedure 
one can obtain classes of classes, etc., so that the logical system obtained 
would resemble the theory of simple types except for the circumstance that 
mixture of types would be possible. Axiomatic set theory appears, then, as 
an extrapolation of this scheme for the case of infinitely many individuals 
or an infinite iteration of the process of forming sets. 

145 | Ramsey’s viewpoint is, of course, everything but constructivistic, unless 

one means constructions of an infinite mind. Russell, in the second edition 
of Principia, took a less metaphysical course by confining himself to such 
truth-functions as can actually be constructed. In this way one is again led 
to the theory of orders, which, however, appears now in a new light, namely 
as a method of constructing more and more complicated truth-functions of 
atomic propositions. But this procedure seems to presuppose arithmetic in 
some form or other (see next paragraph). 

As to the question of how far mathematics can be built up on this basis 
(without any assumptions about the data i.e., about the primitive pred¬ 
icates and individuals except, as far as necessary, the axiom of infinity), 
it is clear that the theory of real numbers in its present form cannot be 
obtained. 37 As to the theory of integers, it is contended in the second edi¬ 
tion of Principia that it can be obtained. The difficulty to be overcome 


36 The Xi may, of course, as always, be partly or wholly identical with each other. 

37 As to the question how far it is possible to build up the theory of real numbers 
presupposing the integers, cf. Weyl 1918 or 1932. 
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is that in the definition of the integers as “those cardinals which belong to 
every class containing 0 and containing x + 1 if containing x”, the phrase 
“every class” must refer to a given order. So one obtains integers of dif¬ 
ferent orders, and complete induction can be applied to integers of order 
n only for properties of order n; whereas it frequently happens that the 
notion of integer itself occurs in the property to which induction is applied. 
This notion, however, is of order n +1 for the integers of order n. Now, in 
Appendix B of the second edition of Principia, a proof is offered that the 
integers of any order higher than 5 are the same as those of order 5, which 
of course would settle all difficulties. The proof as it stands, however, is 
certainly not conclusive. In the proof of the main lemma *89.16, which 
says that every subset a (of arbitrarily high order) 38 of an inductive class 
(3 of order 3 is itself an inductive class of order 3, induction is applied to 
a property of fi involving a (namely a — 0 ^ A, which, however, | should 
read a - f3 ~ e Induct 2 , because (3) is evidently false). This property, 
however, is of an order > 3 if a is of an order > 3. So the question whether 
(or to what extent) the theory of integers can be obtained on the basis of 
the ramified hierarchy must be considered as unsolved at the present time. 
It is to be noted, however, that, even in case this question should have a 
positive answer, this would be of no value for the problem whether arith¬ 
metic follows from logic, if propositional functions of order n are defined 
(as in the second edition of Principia) to be certain finite (though arbitrar¬ 
ily complex) combinations (of quantifiers, propositional connectives, etc.), 
because then the notion of finiteness has to be presupposed, which fact is 
concealed only be taking such complicated notions as “propositional func¬ 
tion of order n” in an unanalyzed form as primitive terms of the formalism 
and giving their definition only in ordinary language. The reply may per¬ 
haps be offered that in Principia the notion of a propositional function of 
order n is neither taken as primitive nor defined in terms of the notion 
of a finite combination, but rather quantifiers referring to propositional 
functions of order n (which is all one needs) are defined as certain infinite 
conjunctions and disjunctions. But then one must ask: Why doesn’t one 
define the integers by the infinite disjunction: 

® = 0Vz = O + 1Vi = 0+1 + 1V" - ad infinitum, 

saving in this way all the trouble connected with the notion of inductive¬ 
ness? This whole objection would not apply if one understands by a propo¬ 
sitional function of order n one “obtainable from such truth-functions of 


38 That the variable a is intended to be of undetermined order is seen from the later 
applications of *89.17 and from the note to *89.17. The main application is in line (2) of 
the proof of *89.24, where the lemma under consideration is needed for a’s of arbitrarily 
high orders. 
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atomic propositions as presuppose for their definition no totalities except 
those of the propositional functions of order < n and of individuals”; this 
notion, however, is somewhat lacking in precision. 

The theory of orders proves more fruitful if considered from a purely 
mathematical standpoint, independently of the philosophical question 
whether impredicative definitions are admissible. Viewed in this manner, 
i.e., as a theory built up within the framework of ordinary mathemat¬ 
ics, where impredicative definitions are admitted, there is no objection to 
extending it to arbitrarily high transfinite orders. Even if one rejects im- 

147 predicative definitions, there would, I think, be no objection to | extend it 
to such transfinite ordinals as can be constructed within the framework of 
finite orders. The theory in itself seems to demand such an extension since 
it leads automatically to the consideration of functions in whose definition 
one refers to all functions of finite orders, and these would be functions 
of order u. Admitting transfinite orders, an axiom of reducibility can be 
proved. This, however, offers no help to the original purpose of the theory, 
because the ordinal a —such that every propositional function is extension- 
ally equivalent to a function of order a —is so great, that it presupposes 
impredicative totalities. Nevertheless, so much can be accomplished in this 
way, that all impredicativities are reduced to one special kind, namely the 
existence of certain large ordinal numbers (or well-ordered sets) and the 
validity of recursive reasoning for them. In particular, the existence of a 
well-ordered set, of order type oq already suffices for the theory of real num¬ 
bers. In addition this transfinite theorem of reducibility permits the proof 
of the consistency of the axiom of choice, of Cantor’s continuum hypothesis 
and even of the generalized continuum hypothesis (which says that there 
exists no cardinal number between the power of any arbitrary set and the 
power of the set of its subsets) with the axioms of set theory as well as of 
Principia. 

I now come in somewhat more detail to the theory of simple types which 
appears in Principia as combined with the theory of orders; the former is, 
however, (as remarked above) quite independent of the latter, since mixed 
types evidently do not contradict the vicious circle principle in any way. 
Accordingly, Russell also based the theory of simple types on entirely dif¬ 
ferent reasons. The reason adduced (in addition to its “consonance with 
common sense”) is very similar to Frege’s, who, in his system, already had 
assumed the theory of simple types for functions, but failed to avoid the 
paradoxes, because he operated with classes, (or rather functions in exten¬ 
sion) without any restriction. This reason is that (owing to the variable 
it contains) a propositional function is something ambiguous (or, as Frege 
says, something unsaturated, wanting supplementation) and therefore can 
occur in a meaningful proposition only in such a way that this ambiguity 

148 is eliminated (e.g., by substituting a | constant for the variable or applying 
quantification to it). The consequences are that a function cannot replace 
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an individual in a proposition, because the latter has no ambiguity to be re¬ 
moved, and that functions with different kinds of arguments (i.e., different 
ambiguities) cannot replace each other; which is the essence of the theory 
of simple types. Taking a more nominalistic viewpoint (such as suggested 
in the second edition of Principia and in Meaning and truth) one would 
have to replace “proposition” by “sentence” in the foregoing considerations 
(with corresponding additional changes). But, in both cases, this argument 
clearly belongs to the order of ideas of the “no class” theory, since it con¬ 
siders the notions (or propositional functions) as something constructed 
out of propositions or sentences by leaving one or several constituents of 
them undetermined. Propositional functions in this sense are so to speak 
“fragments” of propositions, which have no meaning in themselves, but 
only in so far as one can use them for forming propositions by combining 
several of them, which is possible only if they “fit together”, i.e., if they 
are of appropriate types. But, it should be noted that the theory of simple 
types (in contradistinction to the vicious circle principle) cannot in a strict 
sense follow from the constructive standpoint, because one might construct 
notions and classes in another way, e.g., as indicated on page 144, where 
mixtures of types are possible. If on the other hand one considers con¬ 
cepts as real objects, the theory of simple types is not very plausible since 
what one would expect to be a concept (such as, e.g., “transitivity” or the 
number two) would seem to be something behind all its various “realiza¬ 
tions” on the different levels and therefore does not exist according to the 
theory of types. Nevertheless, there seems to be some truth behind this 
idea of realizations of the same concept on various levels, and one might, 
therefore, expect the theory of simple types to prove useful or necessary 
at least as a stepping stone for a more satisfactory system, a way in which 
it has already been used by Quine. 39 Also Russell’s “typical ambiguity” is 
149 a step in this direction. Since, however, it only adds certain simpli|fying 
symbolic conventions to the theory of types, it does not de facto go beyond 
this theory. 

It should be noted that the theory of types brings in a new idea for 
the solution of the paradoxes, especially suited to their intensional form. 
It consists in blaming the paradoxes not on the axiom that every proposi¬ 
tional function defines a concept or class, but on the assumption that every 
concept gives a meaningful proposition, if asserted for any arbitrary object 
or objects as arguments. The obvious objection that every concept can 
be extended to all arguments, by defining another one which gives a false 
proposition whenever the original one was meaningless, can easily be dealt 
with by pointing out that the concept “meaningfully applicable” need not 
itself be always meaningfully applicable. 


’Quine 1937. 
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The theory of simple types (in its realistic interpretation) can be consid¬ 
ered as a carrying through of this scheme, based, however, on the following 
additional assumption concerning meaningfulness: “Whenever an object x 
can replace another object y in one meaningful proposition, it can do so 
in every meaningful proposition.” 40 This of course has the consequence 
that the objects are divided into mutually exclusive ranges of significance, 
each range consisting of those objects which can replace each other; and 
that therefore each concept is significant only for arguments belonging to 
one of those ranges, i.e., for an infinitely small portion of all objects. What 
makes the above principle particularly suspect, however, is that its very as¬ 
sumption makes its formulation as a meaningful proposition impossible, 41 
because x and y must then be confined to definite ranges of significance 
which are either the same or different, and in both cases the statement 
does not express the principle or even part of it. Another consequence is 
that the fact that an object x is (or is not) of a given type also cannot be 
expressed by a meaningful proposition. 

150 | It is not impossible that the idea of limited ranges of significance could 

be carried out without the above restrictive principle. It might even turn 
out that it is possible to assume every concept to be significant everywhere 
except for certain “singular points” or “limiting points”, so that the para¬ 
doxes would appeal’ as something analogous to dividing by zero. Such a 
system would be most satisfactory in the following respect: our logical 
intuitions would then remain correct up to certain minor corrections, i.e., 
they could then be considered to give an essentially correct, only somewhat 
“blurred”, picture of the real state of affairs. Unfortunately the attempts 
made in this direction have failed so far; 42 on the other hand, the impos¬ 
sibility of this scheme has not been proved either, in spite of the strong 
inconsistency theorems of Kleene and Rosser. 43 

In conclusion I want to say a few words about the question whether 
(and in which sense) the axioms of Principia can be considered to be an¬ 
alytic. As to this problem, it is to be remarked that analyticity may be 
understood in two senses. First, it may have the purely formal sense that 


40 Russell formulates a somewhat different principle with the same effect, in Principia, 
vol. 1, p. 95. 

41 This objection does not apply to the symbolic interpretation of the theory of types, 
spoken of on p. 148, because there one does not have objects but only symbols of different 
types. 

42 A formal system along these lines is Church’s (cf. his 1932 and 1933), where, 
however, the underlying idea is expressed by the somewhat misleading statement that 
the law of excluded middle is abandoned. However, this system has been proved to be 
inconsistent. See footnote 43. 

43 Cf. Kleene and Rosser 1935. 
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the terms occurring can be defined (either explicitly or by rules for elimi¬ 
nating them from sentences containing them) in such a way that the axioms 
and theorems become special cases of the law of identity and disprovable 
propositions become negations of this law. In this sense even the theory 
of integers is demonstrably non-analytic, provided that one requires of the 
rules of elimination that they allow one actually to carry out the elimina¬ 
tion in a finite number of steps in each case. 44 Leaving out this condition 
by admitting, e.g., sentences of infinite (and non-denumerable) length as 
intermediate steps of the process of reduction, all axioms of Principia | (in- 151 
eluding the axioms of choice, infinity and reducibility) could be proved to 
be analytic for certain interpretations (by considerations similar to those 
referred to on page 144). 45 But this observation is of doubtful value, be¬ 
cause the whole of mathematics as applied to sentences of infinite length 
has to be presupposed in order to prove this analyticity, e.g., the axiom of 
choice can be proved to be analytic only if it is assumed to be true. 

In a second sense a proposition is called analytic if it holds “owing to the 
meaning of the concepts occurring in it”, where this meaning may perhaps 
be undefinable (i.e., irreducible to anything more fundamental). 46 It would 
seem that all axioms of Principia , in the first edition, (except the axiom of 
infinity) are in this sense analytic for certain interpretations of the prim¬ 
itive terms, namely if the term “predicative function” is replaced either 
by “class” (in the extensional sense) or (leaving out the axiom of choice) 
by “concept”, since nothing can express better the meaning of the term 
“class” than the axiom of classes (cf. page 140) and the axiom of choice, 
and since, on the other hand, the meaning of the term “concept” seems 
to imply that every propositional function defines a concept. 47 The diffi¬ 
culty is only that we don’t perceive the concepts of “concept” and of “class” 


44 Because this would imply the existence of a decision procedure for all arithmetical 
propositions. Cf. Turing 1931. 

45 Cf. also Ramsey 1926, where, however, the axiom of infinity cannot be obtained, 
because it is interpreted to refer to the individuals in the world. 

46 The two significations of the term analytic might perhaps be distinguished as tau¬ 
tological and analytic. 

47 This view does not contradict the opinion defended above that mathematics is 
based on axioms with a real content, because the very existence of the concept of e.g., 
“class” constitutes already such an axiom; since, if one defined e.g., “class” and “e” to 
be “the concepts satisfying the axioms”, one would be unable to prove their existence. 
“Concept” could perhaps be defined in terms of “proposition” (cf. p. 148), although 
I don’t think that this would be a natural procedure; but then certain axioms about 
propositions, justifiable only with reference to the undefined meaning of this term, will 
have to be assumed. It is to be noted that this view about analyticity makes it again 
possible that every mathematical proposition could perhaps be reduced to a special case 
of a = a, namely if the reduction is effected not in virtue of the definitions of the terms 
occurring, but in virtue of their meaning, which can never be completely expressed in a 
set of formal rules. 
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with sufficient distinctness, as is shown by the paradoxes. In view of this 

152 situation, Russell took the course of considering | both classes and concepts 
(except the logically uninteresting primitive predicates) as nonexistent and 
of replacing them by constructions of our own. It cannot be denied that this 
procedure has led to interesting ideas and to results valuable also for one 
taking the opposite viewpoint. On the whole, however, the outcome has 
been that only fragments of mathematical logic remain, unless the things 
condemned are reintroduced in the form of infinite propositions or by such 
axioms as the axiom of reducibility which (in case of infinitely many in¬ 
dividuals) is demonstrably false unless one assumes either the existence of 
classes or of infinitely many “qualitates occultae ". This seems to be an 
indication that one should take a more conservative course, such as would 
consist in trying to make the meaning of the terms “class” and “concept” 
clearer, and to set up a consistent theory of classes and concepts as objec¬ 
tively existing entities. This is the course which the actual development 
of mathematical logic has been taking and which Russell himself has been 
forced to enter upon in the more constructive parts of his work. Major 
among the attempts in this direction (some of which have been quoted in 
this essay) are the simple theory of types (which is the system of the first 
edition of Principia in an appropriate interpretation) and axiomatic set 
theory, both of which have been successful at least to this extent, that they 
permit the derivation of modern mathematics and at the same time avoid 
all known paradoxes. Many symptoms show only too clearly, however, that 
the primitive concepts need further elucidation. 

It seems reasonable to suspect that it is this incomplete understanding 
of the foundations which is responsible for the fact that mathematical logic 
has up to now remained so far behind the high expectations of Peano 
and others who (in accordance with Leibniz’s claims) had hoped that it 
would facilitate theoretical mathematics to the same extent as the decimal 
system of numbers has facilitated numerical computations. For how can 
one expect to solve mathematical problems systematically by mere analysis 
of the concepts occurring if our analysis so far does not even suffice to set 
up the axioms? But there is no need to give up hope. Leibniz did not 
in his writings about the Characteristica universalis speak of a utopian 

153 project; if we are to | believe his words he had developed this calculus of 
reasoning to a large extent, but was waiting with its publication till the 
seed could fall on fertile ground. 48 He went even so far 49 as to estimate the 
time which would be necessary for his calculus to be developed by a few 
select scientists to such an extent “that humanity would have a new kind 
of an instrument increasing the powers of reason far more than any optical 


48 Leibniz 1890, p. 12. Cf. also Vacca 1903, p. 72, and the preface to Leibniz 1923. 

49 Leibniz 1890, p. 187. 
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instrument has ever aided the power of vision.” The time he names is five 
years, and he claims that his method is not any more difficult to learn than 
the mathematics or philosophy of his time. Furthermore, he said repeatedly 
that, even in the rudimentary state to which he had developed the theory 
himself, it was responsible for all his mathematical discoveries; which, one 
should expect, even Poincare would acknowledge as a sufficient proof of its 
fecundity. 50 


50 I wish to express my thanks to Professor Alonzo Church of Princeton University, 
lo helped me to find the correct English expressions in a number of places. 
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Introductory note to 1946 


In 1946, the year of its bicentennial, Princeton University organized 
several conferences. The conference on problems of mathematics took 
place on 17-19 December 1946. Its more than seventy participants in¬ 
cluded most of the leading American mathematicians and several from 
abroad. Its sessions included discussions of practically the whole range 
of pure mathematics. The university printed a pamphlet containing a 
summary of the discussion in each of the sessions ( Princeton University 
1947). Alfred Tarski was listed as “discussion leader” in the session on 
mathematical logic, held on 17 December, and evidently gave the prin¬ 
cipal invited talk. Godel’s remarks took their point of departure from 
Tarski’s talk, and may have been invited or planned as a comment. 

No proceedings of the conference were published, but a larger volume 
entitled Problems of mathematics was planned at the time. Godel sent 
his paper to J. C. C. McKinsey early in 1947 for this purpose. Corre¬ 
spondence with J. W. Tukey indicates that a chapter was planned on the 
Hilbert problems, for which Godel was asked to write on the first two 
(the continuum problem and the problem of the consistency of analy¬ 
sis). Godel agreed to do the first but declined to do the second, on 
the ground that it ought to be done by someone more sympathetic to 
Hilbert’s views. 

Although it could not have been unknown, the paper Godel 1946 
seems to have had little circulation before its publication in Davis 1965. a 
Its main new technical idea, ordinal definability, was rediscovered inde¬ 
pendently by several others (see below). The reprint Godel 1968 con¬ 
tained several changes in the text. Correspondence between Godel and 
Mario Casolini, evidently an editor with the publisher of 1968, shows 
that the principal changes were proposed by Godel. 

Godel’s paper consists largely of an exploration of possible absolute 
notions of demonstrability and definability, which would not have to be 
relative to a particular formal system or formalized language. From the 
summary of the discussion ( Princeton University 1947, pages 10-12), ev¬ 
idently much of the session was concerned with decision problems; Godel 
thus began by noting that, with the concept of general recursiveness or 
Turing computability, “one has for the first time succeeded in giving an 
absolute definition of an interesting epistemological notion, i.e., one not 


a Davis has informed me that he did not know of the existence of this paper until 
S. C. Kleene suggested its inclusion in The undecidable and supplied a copy. Kleene 
has indicated that the paper was published there with Godel’s approval. 
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depending on the formalism chosen” (Godel 1946, page 1). Godel refers 
here not primarily to the equivalence of different formulations such as 
Turing computability, A-definability and Herbrand Godel general recur¬ 
siveness, but to the absence of the sort of relativity to a given language 
that leads to stratification of the notion, such as (in the case of defin¬ 
ability in a formalized language) into definability in languages of greater 
and greater expressive power. Such stratification is driven by diagonal 
arguments. But, since a function enumerating the recursive functions is 
not recursive and there is no reason to think it computable, the diagonal 
function it gives rise to is simply non-recursive, rather than “recursive 
at the next level”. One can of course effectively enumerate computing 
procedures (partial recursive functions), but then the diagonal proce¬ 
dure simply leads to partial recursive functions that must be undefined 
for certain arguments (and to the undecidability of the question whether 
an arbitrary partial recursive function is defined for a given argument). 

Godel is thus encouraged to search for absolute notions of demon- 
strability and definability. His remarks on the former notion (pages 
1-2) are brief. Reflection on a formalism that makes the notion of prov¬ 
ability precise “gives rise to new axioms which are exactly as evident and 
justified as those with which you started”. (That the new axioms are 
exactly as evident might be questioned, even in the case of the weakest, 
such as the statement of the consistency of the formalism.) The process 
of transfinite iteration of such extension, which he then mentions, had 
already been studied in Turing 1939-, completeness results of the sort en¬ 
visaged in the remark that all steps of such an extension process “could 
be described and collected together in some non-constructive way” were 
proved in Feferman 1962, but such results are essentially arithmetic in 
character. In set theory, Godel suggests that absolute provability would 
have to incorporate a notion of proof using stronger and stronger axioms 
of infinity. The notion of an “axiom of infinity” could not be given a 
“combinational and decidable” characterization, but an axiom of infinity 
might be characterized as a sentence of a certain formal structure that is 
true-, then by absolute demonstrability one might mean proof with the 
help of axioms of infinity in that sense. But evidently Godel thought this 
suggestion very speculative. It was perhaps bound up with the hope, so 
far disappointed, that suitable axioms of infinity would suffice to decide 
the continuum problem. 

Gddel now turns to mathematical definability, where he says he “can 
give somewhat more definite suggestions”. A hierarchy of concepts of 
definability is forced on us by a “finitistic concept of language”, but to 
collect into one notion all the stages of a hierarchy of this kind requires 
“as many primitive terms as you wish to consider steps in this hierar¬ 
chy of languages, i.e., as many as there are ordinal numbers” (page 2). 
It is noteworthy that he considers immediately a hierarchy indexed 
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by arbitrary ordinals in the set-theoretic sense. He is led immediately to 
the concept of ordinal definability (pages 2-3): 

The simplest way of doing it is to take the ordinals themselves 
as primitive terms. So one is led to the concept of definability in 
terms of ordinals, i.e., definability by expressions containing names 
of ordinal numbers and logical constants, including quantification 
referring to sets. 

Godel observes that one obtains no new ordinal-definable sets by extend¬ 
ing the language by a truth predicate, presumably for formulas contain¬ 
ing names of arbitrary ordinals. This implies immediately that “x is an 
ordinal-definable set” is expressible in the language of ZF, an assertion 
that has usually been proved by means of Levy’s reflection principle. 13 An 
extension of that argument proves Godel’s assertion about the ordinal- 
definable sets ( OD). c It is reasonable to conjecture that Gddel knew the 
reflection principle at the time. Otherwise he would have had to have a 
quite different argument in mind; moreover, an argument like the proof 
of the principle figures in the proof in Gddel 1940 that V = L (where 
V is the universe of all sets and L is the class of all constructible sets) 
implies the generalized continuum hypothesis. 

Gddel remarks that his notion of constructible set is also a kind of 
definability in terms of ordinals (see Gddel 1939a). Since it admits quan¬ 
tification only over constructible sets and not over sets in general, how¬ 
ever, “you can actually define sets, and even sets of integers, for which 
you cannot prove that they are constructible” {1946, page 3). For this 
reason he considers constructibility not satisfactory as a notion of defin¬ 
ability. Godel’s language is puzzling, since he seems to be saying that, 
for some definition of a set of integers, it. is unprovable that that set is 


b My hit l and Scott 1971, p. 272; Krivine 1968 or 1971, Chapter 6. 
c Let Sat(x,y) mean: a: is a formula of the language of set theory, y is an as¬ 
signment of objects to the free variables of x, and y satisfies x. Now let A(x) be a 
formula of the language of set theory augmented by Sat, with ordinal parameters, 
that is uniquely true of the set xq. Then, by the extended reflection principle, we 
can find an ordinal f3 such that xq e V/}, the ordinal parameters are less than [j and 
(Vp, f) is an elementary substructure of the universe. In other words, for all formulas 
u of the language of set theory and all assignments y of objects in Vp, 

(1) Sat(u,y) <-> Sato(u,Vp,y), 

where Sato(x,z,y) is the satisfaction predicate for formulas with quantifiers inter¬ 
preted to range over z, and moreover Vp reflects the formula A(x), that is, for any 
xeVp, 

(2) A(x)~A v l>(x). 

But now (1) implies that in A V B (x) we may replace any subformula Sat(u,v ) by 
Sato(u, Vp, v), which is a formula of ZF. It follows by (2) that x„ is definable in ZF 
from the given ordinal parameters and /?, that is, 20 e OD. 
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constructible. How would he have known this then? The first published 
proof of the consistency of the existence of non-constructible sets of in¬ 
tegers is in Cohen 1963. d 

The remainder of the paper deals with the claim that the notion 
of ordinal definability is a satisfactory absolute notion and with the 
mathematical interest of this notion. Reversing Godel’s order, we note 
that he conjectures that the ordinal-definable sets satisfy the axioms of 
set theory “and so will lead to another and probably simpler proof of the 
consistency of the axiom of choice” (page 4). He does not claim to be in 
possession of such a proof, which has since been given by others by means 
of the observation that the hereditarily ordinal-definable sets (HOD) are 
an inner model satisfying the axiom of choice. Since the proof of this is 
rather straightforward (see Myhill and Scott 1971), Godel’s caution may 
mean simply that he had not worked it out in detail. Godel’s second and 
closing remark is that, although it can be proved (in ZF) that there is a 
bound on the ordinals needed to obtain ordinal-definable sets of integers, 
he doubts that one can prove that the bound is wi, as in the case of the 
constructible sets. Therefore the proof of the consistency of the axiom 
of choice will not extend to the continuum hypothesis as did that by the 
constructible sets. It was subsequently proved in McAloon 1966 that 
V = HOD is consistent with the negation of the continuum hypothesis, 
thus verifying this conjecture of Godel’s® Godel may have thought this 
a virtue of the notion, since V = HOD might then be compatible with 
the sort of axiom Godel speculates about in 1947 that would refute the 
continuum hypothesis. f He offers no direct comment on the question of 
the truth ofV = OD , g 

Godel’s philosophical remarks are prompted by the obvious objection 
that admitting all ordinals as primitive terms makes his notion no longer 


d It was later shown by Solovay (1967) that strong axioms of infinity, such as the 
existence of a measurable or Ramsey cardinal, imply the existence of even analytically 
definable sets (of integers) that are not constructible. 

e Godel wished to add to 1968 a note stating this result, but it was received too 
late by the publisher. The note is included in the present volume. 

f In default of the axioms, there cannot be definite results on this question. How¬ 
ever, V = OD has been shown consistent with strong axioms of infinity incompatible 
with V = L. Let M be the proposition that there exists a measurable cardinal. 
McAloon 1966 shows that, if ZF + M is consistent, then ZF + M + V = OD is 
consistent. (This follows from Corollary 6.5 of Kunen 1970.) It is extended from 
measurable to supercompact cardinals in Menas 1973. 

Yet it seems doubtful that Godel’s primary interest in the notion was as a means 
of proving the consistency of the axiom of choice in settings where V = L fails. Wang 
reports that in 1941 Godel had another general method of proving the consistency 
of the axiom of choice- ( Wang 1981, p. 657). 

6 Note that, since V a e OD and HOD is transitive, if HOD = OD then V = 
HOD = OD. Thus V = OD, HOD = OD, and V = HOD are all equivalent. 
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a notion of definability. He finds it plausible that “all things conceivable 
by us are denumerable”. Does this mean that for any x, if x is conceiv¬ 
able by us, then x is denumerable, or that there are only denumerably 
many things conceivable by us? The latter reading seems more likely. 
Godel remarks that, because of the paradox of the least indefinable or¬ 
dinal, a notion of mathematical definability that makes the notion itself 
mathematically definable will have to have all ordinals definable. But 
what follows is that a notion satisfying the “postulate of denumerability” 
must involve some “extramathematical element concerning the psychol¬ 
ogy of the being which deals with mathematics” (page 4). It seems to 
me that the point is not just that to characterize such a notion will re¬ 
quire some extramathematical vocabulary, for what will rule out a least 
ordinal not definable with the help of the extra vocabulary? Apparently 
the extra vocabulary must have the property that “definable with the 
help of the extra vocabulary” is no longer definable with the help of the 
extra vocabulary. Godel finally argues that ordinal definability at least 
captures the notion of “being formed according to a law” as opposed to 
“being formed by a random choice of the elements” (page 4). In partic¬ 
ular, there is not a random element in the ordinals themselves. 

I would note further that, since the notion of definability contains a 
modal element, the question whether ordinal definability is a genuine no¬ 
tion of definability depends on the underlying modal notion, the “can” in 
“can be defined”. Admitting all ordinals as definable might be viewed as 
an extreme extension of the notion of abstract mathematical possibility 
that arises in other contexts in the foundations of mathematics, such as 
that of computable function, where complete abstraction is made both 
from the limitations of “hardware” and from feasibility in terms of the 
time required for a computation. It should be kept in mind that an or¬ 
dinal definition of a set requires only finitely many ordinals (which can 
be reduced to one). To deny that an ordinal-definable set is “really” de¬ 
finable implies the existence of ordinals that are not really definable, no 
matter how our means of such definition might be extended. It is hard 
to see how a case could be made out for this so long as one stays on the 
abstract mathematical plane and does not introduce notions concerning 
“the psychology of the being which deals with mathematics”, at least in 
a broad sense. 11 

It appears that subsequent work on ordinal definability was done al¬ 
most entirely independently of Godel’s. In 1952, Post rediscovered the 


h In unpublished work, Allen Hazen argues that a physicalist theory of the mind 
implies that there is an absolute notion of definability satisfying the condition of 
denumerability, or at least some other cardinality restriction that makes the paradox 
of the least indefinable ordinal a genuine problem. 
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notion and proved some of its principal properties (see Myhill and Scott 
1971, page 278), but his abstract 1953 was evidently too cryptic to be 
understood, and Post died soon afterward. The idea was also implicit in 
Takeuti 1961. The concept became more widely known through the work 
of Myhill and Scott in the early 1960s. They showed the definability of 
u x is an ordinal-definable set”, carried out the relative consistency proof 
for the axiom of choice, and also proved the equivalence of ordinal defin¬ 
ability and constructibility if both are phrased in terms of definability in 
second-order logic. Further work was stimulated by Cohen’s discovery 
of the method of forcing. Since obviously L C HOD C OD C V, and 
HOD — OD implies V = OD (see footnote g), the question naturally 
arises as to what can consistently be assumed about the equality and 
inequality of L, HOD , and V. Let us define A C B to be A C B and 
A ^ B. Levy (1965) proved that Cohen’s model showing the indepen¬ 
dence of V = L from ZFC + GCH satisfies L — HOD, thus establishing 
the consistency of L = HOD c V. McAloon (1966) showed the consis¬ 
tency of L C HOD — V, both with GCH and with CH false (see his 
1971 ). In 1966 he also established the consistency of L C HOD C V. 

Classes related to HOD have been used in model constructions for 
other purposes. A noteworthy example occurs in Solovay 1970. The 
main result of his paper is the consistency relative to ZF + DC (the 
axiom of dependent choices) + I (‘there exists a strongly inaccessible 
cardinal’) of a list of propositions including ‘every set of reals is Lebesgue 
measurable’. One constructs a forcing extension AT of a ground model 
M of ZFC + I. With the help of the fact that, in N, a strongly inac¬ 
cessible cardinal of M is collapsed to Hi, it is shown that, although the 
axiom of choice still holds in N, all sets of reals definable from a count¬ 
able sequence of ordinals are Lebesgue measurable (Theorem 2). Within 
N, one then constructs the submodel Ni of sets hereditarily definable 
from countable sequences of ordinals; in Ni all sets of reals are Lebesgue 
measurable (1970, page 52). 

The exploration in 1946 of the notions of absolute demonstrability 
and absolute definability, and the development in the latter context of 
the notion of ordinal definability, are instances of the application to con¬ 
crete problems of Godel’s realistic point of view. 1 But it should be noted 
that Godel qualifies the “absoluteness” of the notions he considers: they 
are “not absolute in the strictest sense, but only with respect to a certain 
system of things, namely the sets as described in axiomatic set theory” 
(1946, page 4). The question whether such notions can be treated in a 
“completely absolute way” is left open. 


‘See my introductory 


Godel 1944 in this volume. 
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The pagination in the text follows the original typescript, whose pag¬ 
ination differs somewhat from that of Davis 1965. 


Charles Parsons- 5 

5 1 am indebted to John P. Burgess, Martin Davis, John W. Dawson, Jr., Solomon 
Feferman, Kenneth McAloon, Dana Scott and Hao Wang for information, assistance 
and suggestions. 


Remarks before the 
Princeton bicentennial conference 
on problems in mathematics 

(me) 


Tarski has stressed in his lecture (and I think justly) the great impor¬ 
tance of the concept of general recursiveness (or Turing’s computability). 
It seems to me that this importance is largely due to the fact that with this 
concept one has for the first time succeeded in giving an absolute defini¬ 
tion of an interesting epistemological notion, i.e., one not depending on the 
formalism chosen. 1 In all other cases treated previously, such as demon- 
strability or definability, one has been able to define them only relative to 
a given language, and for each individual language it is clear that the one 
thus obtained is not the one looked for. For the concept of computability, 
however, although it is merely a special kind of demonstrability or decid¬ 
ability, the situation is different. By a kind of miracle it is not necessary 
to distinguish orders, and the diagonal procedure does not lead outside the 
defined notion. This, I think, should encourage one to expect the same 
thing to be possible also in other cases (such as demonstrability or defin¬ 
ability). It is true that for these other cases there exist certain negative 
results, such as the incompleteness of every formalism or the paradox of 
Richard. But closer examination shows that these results do not make a 


1 [Footnote added in 1965: To be more precise: a function of integers is computable 
in any formal system containing arithmetic if and only if it is computable in arith¬ 
metic, where a function / is called computable in 5 if there is in S a computable term 
representing /.] 
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definition of the absolute notions concerned impossible under all circum¬ 
stances, but only exclude certain ways of defining them, or, at least, that 
certain very closely related concepts may be definable in an absolute sense. 

Let us consider, e.g., the concept of demonstrability. It is well known 
that, in whichever way you make it precise by means of a formalism, the 
contemplation of this very formalism gives rise to new axioms which are ex¬ 
actly as evident and justified as those with which you started, and that this 
process of extension can be iterated into the transfinite. So there cannot 
exist any formalism which would embrace all these steps; but this does not 
exclude that all these steps (or at least all of them which give something 
new for the domain | of propositions in which you are interested) could 2 
be described and collected together in some non-constructive way. In set 
theory, e.g., the successive extensions can most conveniently be represented 
by stronger and stronger axioms of infinity. It is certainly impossible to 
give a combinational and decidable characterization of what an axiom of 
infinity is; but there might exist, e.g., a characterization of the following 
sort: An axiom of infinity is a proposition which has a certain (decidable) 
formal structure and which in addition is true. Such a concept of demon¬ 
strability might have the required closure property, i.e., the following could 
be true: Any proof for a set-theoretic theorem in the next higher system 
above set theory (i.e., any proof involving the concept of truth which I just 
used) is replaceable by a proof from such an axiom of infinity. It is not 
impossible that for such a concept of demonstrability some completeness 
theorem would hold which would say that every proposition expressible in 
set theory is decidable from the present axioms plus some true assertion 
about the largeness of the universe of all sets. 

Let me consider a second example where I can give somewhat more 
definite suggestions, namely the concept of definability (or, to be more 
exact, of mathematical definability). Here also you have, corresponding 
to the transfinite hierarchy of formal systems, a transfinite hierarchy of 
concepts of definability. Again it is not possible to collect together all 
these languages in one, as long as you have a finitistic concept of language, 
i.e., as long as you require that a language must have a finite number of 
primitive terms. But, if you drop this condition, it does become possible 
(at least as far as it is necessary for the purpose), namely, by means of a 
language which has as many primitive terms as you wish to consider steps 
in this hierarchy of languages, i.e., as many as there are ordinal numbers. 
The simplest way of doing it is to take the ordinals themselves as primitive 
terms. So one is led to the concept of definability in terms of ordinals, | i.e., 3 

definability by expressions containing names of ordinal numbers and logical 
constants, including quantification referring to sets. This concept should, 

I think, be investigated. It can be proved that it has the required closure 
property: By introducing the notion of truth for this whole transfinite 
language, i.e., by going over to the next language, you will obtain no new 
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definable sets (although you will obtain new definable properties of sets). 

The concept of constructible set I used in the consistency proof for the 
continuum hypothesis can be obtained in a very similar way, i.e., as a kind 
of definability in terms of ordinal numbers; but, comparing constructibility 
with the concept of definability just outlined, you will find that not all 
logical means of definition are admitted in the definition of constructible 
sets. Namely, quantification is admitted only with respect to constructible 
sets and not with respect to sets in general. This has the consequence 
that you can actually define sets, and even sets of integers, for which you 
cannot prove that they are constructible (although this can of course be 
consistently assumed). For this reason, I think constructibility cannot be 
considered as a satisfactory formulation of definability. 

But now, coming back to the definition of definability I suggested, it 
might be objected that the introduction of all ordinals as primitive terms 
is too cheap a way out of the difficulty, and that the concept thus ob¬ 
tained completely fails to agree with the intuitive concept we set out to 
make precise, because there exist undenumerably many sets definable in 
this sense. There is certainly some justification in this objection. For it 
has some plausibility that all things conceivable by us are denumerable, 
even if you disregard the question of expressibility in some language. But, 
on the other hand, there is much to be said in favor of the concept under 
consideration; namely, above all it is clear that, if the concept of mathemat¬ 
ical definability is to be itself mathematically definable, it must necessarily 
be so that all ordinal numbers are definable, because otherwise you could 
4 define the first ordinal number not definable, and | would thus obtain a 
contradiction. I think this does not mean that a concept of definability 
satisfying the postulate of denumerability is impossible, but only that it 
would involve some extramathematical element concerning the psychology 
of the being who deals with mathematics. 

But, irrespective of what the answer to this question may be, I would 
think that “definability in terms of ordinals”, even if it is not an ade¬ 
quate formulation for “comprehensibility by our mind”, is at least an ad¬ 
equate formulation in an absolute sense for a closely related property of 
sets, namely, the property of “being formed according to a law” as opposed 
to “being formed by a random choice of the elements”. For, in the ordi¬ 
nals there is certainly no element of randomness, and hence neither in sets 
defined in terms of them. This is particularly clear if you consider von Neu¬ 
mann’s definition of ordinals, because it is not based on any well-ordering 
relations of sets, which may very well involve some random element. 

You may have noticed that, in both examples I gave, the concepts ar¬ 
rived at or envisaged were not absolute in the strictest sense, but only 
with respect to a certain system of things, namely the sets as conceived in 
axiomatic set theory; i.e., although there exist proofs and definitions not 
falling under these concepts, these definitions and proofs give, or are to 
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give, nothing new within the domain of sets and of propositions expressible 
in terms of “set”, “e” and the logical constants. The question whether the 
two epistemological concepts considered, or any others, can be treated in 
a completely absolute way is of an entirely different nature. 

In conclusion I would like to say that, irrespective of whether the concept 
of definability suggested in this lecture corresponds to certain intuitive 
notions, it has some intrinsic mathematical interest; in particular, there are 
two questions arising in connection with it: (1) Whether the sets definable 
in this sense satisfy the axioms of set theory. I think this question is to 
be answered in the affirmative, and so will lead to another, and probably 
simpler, proof for the consistency of the axiom of choice. (2) It follows from 
the axiom of replacement that the ordinals necessary to define all sets of 
integers which can at all be defined in this way will have an upper limit. I 
doubt that it will be possible to prove that this upper limit is uq, as in the 
case of the constructible sets. 2 


2 [Footnote added on 26 June 1968: I have recently been informed that this conjecture 
has been verified by Kenneth McAloon in a dissertation at the University of California 
at Berkeley: to be more precise, that Dr. McAloon, using Cohen’s method, has proved 
the consistency (with the Zermelo-Fraenkel axioms of set theory) of the assumption that 
all sets are ‘ordinal definable’ and that 2 N ° is much greater than Hi.] 






Introductory note to 1947 and 1964 


1. Introduction 

Cantor’s continuum problem served as one of the principal and peri¬ 
odic foci for Godel’s research from 1935 until his death more than four 
decades later. His article 1947 (substantially revised and expanded to 
become 1964) originated from a request, made in 1945 by the editor of 
the American mathematical monthly, for a paper on the continuum prob¬ 
lem. The result was an expository article written in the style for which 
the Monthly is well known, but having a flavor that reflected Godel’s 
distinctive blend of mathematical and philosophical interests. Although 
194 7 contains no new technical results, it gives considerable insight into 
his philosophical views on set theory and on what would and would not, 
in his opinion, constitute a solution to the continuum problem. In one 
sense, 1947 can be regarded as a continuation, and as a variation in 
a different key, of his reflections in 1944 on Russell and mathematical 
logic. Like 1944 , the article 7 originated from a request for a contri¬ 
bution by Godel, and included both technical hints for possible future 
research in mathematics and cogent philosophical arguments in favor 
of Platonism. But 1947, unlike 1944 , was expository (indeed, the only 
expository article that Godel ever published) and concerned a specific 
mathematical problem rather than a philosopher’s contribution to logic. 

This introductory note has seven sections, which serve different pur¬ 
poses. Section 2 places 1947 in a historical context by tracing the con¬ 
tinuum problem from its origins to Godel’s attempts (circa 1938-1942) 
to establish the independence of the continuum hypothesis, Section 3 
recounts the circumstances which led Godel to write 1947. The content 
of 1947 is analyzed in Section 4, while Section 5 indicates how Godel’s 
perspective changed in the revised version 1964 (and in his 1966 plans 
for a third version of the paper). Section 6 discusses the effect of recent 
mathematical developments on Godel’s claims in 1947 and 1964■ Fi¬ 
nally. Section 7 concerns his two unpublished articles on the continuum 
hypothesis, both written about 1970. 


2. Historical background to the continuum problem, 
including Godel’s work before 1947 


The continuum problem, which Cantor first posed in 1878, grew out 
of research that he began in 1873. At that time, in a letter to Dedekind, 
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Cantor posed the question whether the set R of real numbers can be 
put in one-to-one correspondence with the set N of natural numbers. 
Although Dedekind at first doubted the importance of this question, 
he was pleased when Cantor discovered a proof that such a correspon¬ 
dence cannot exist. In January 1874 Cantor posed a further question 
to Dedekind: Can a line segment be put in one-to-one correspondence 
with a square and its interior? Three years passed before Cantor suc¬ 
ceeded in showing that there exists such a correspondence between a line 
segment and n-dimensional space for any n. a At the end of the article 
(1878) detailing this proof, Cantor stated that every uncountable set of 
real numbers can be put in one-to-one correspondence with the set of 
all real numbers, i.e., that there is no cardinal number strictly between 
that of N and that of R. This proposition was the original form of the 
continuum hypothesis. Since there is no standard terminology for this 
form, we shall call it the weak continuum hypothesis. 

When in 1883 Cantor developed the notion of well-ordering and as¬ 
serted that every set can be well-ordered, he gave a second and more 
elegant form to this hypothesis: R has the same power as the set of 
countable ordinals. In his aleph notation of 1895 this can be stated as 
2 k ° = N 1; the form in which the continuum hypothesis ( CH) is now 
known. (It is easily seen that CH is equivalent to the conjunction of 
the weak continuum hypothesis and the proposition that R can be well- 
ordered.) Cantor himself never used the term “continuum hypothesis”; 
instead, in his 1882 correspondence with Dedekind, he referred to the 
weak continuum hypothesis as the “two-class theorem”. 

In 1883 Cantor began to generalize CH , asserting that the set of all 
real functions has the third infinite power; in his later notation, this 
stated that 2 Nl = &2- He never discussed any more general form of CH. 
perhaps because he saw no use for such a generalization. The general¬ 
ized continuum hypothesis ( GCH ), which states that 2 Nq = N a+1 for all 
ordinals a, was first formulated by Hausdorff (1908, pages 487, 494) and 
was given this name by Tarski (1925). 

Despite very intense research, especially during 1884, Cantor never 
succeeded in demonstrating CH. However, he obtained a special case of 
the weak continuum hypothesis: Every uncountable closed subset of R 
has the power of R (1884). For a while that year, during August and 
again during October, he believed that he had proved CH, and then, for 
a brief period in November, that he had refuted CH (Moore 1982, pages 
43-44). 

In August 1904 a Hungarian mathematician, J. Konig, also claimed 
to have disproved CH. This occurred in a lecture he gave at the Inter- 


l Se<! Noether and Cavailles 1937, pp. 12-13, 20-21, 25. 
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national Congress of Mathematicians at Heidelberg. However, the next 
day E. Zermelo found the gap in Konig’s argument. When revised for 
publication ( Konig 1905), Konig’s result was that the power of R can¬ 
not equal N a+W for any ordinal a. In the light of Hausdorff’s 1906-1908 
researches on cofinality, the result was extended to the following: 2^° 
cannot equal for any (3 of cofinality cu. In 1947 Godel observed that 
nothing beyond this was known about the cardinality of R. 

As F. Bernstein noted (1901, page 14), one line of research on the 
continuum problem consisted in trying to extend, to larger and larger 
classes of subsets of R, Cantor’s result that the weak continuum hypoth¬ 
esis holds for the closed subsets of R. The hierarchy soon used for this 
purpose was that of the Borel sets, introduced by E. Borel (1898) and 
first extended to transfinite levels by H. Lebesgue (1905). In 1903W. H. 
Young strengthened Cantor’s result by showing that every uncountable 
Gs subset of R has the power of R. A decade later Hausdorff succeeded 
in extending the result further, first to the GgaS sets (1914a) and then 
to the entire Borel hierarchy (1916). 

For the next two decades, almost all progress on CH had a close 
connection with N. Luzin and his students (such as P. S. Aleksandrov 
and M. Suslin), who together made up the Moscow school of function 
theorists. The school’s first result occurred when Aleksandrov (1916) 
obtained the above-mentioned theorem on the Borel hierarchy at the 
same time that Hausdorff did. In 1917 Luzin and Suslin extended the 
Borel hierarchy by introducing the analytic sets, the first level of what 
later became the projective hierarchy. Suslin established that the weak 
continuum hypothesis holds for the analytic sets, now called the X) sets, 
since every uncountable analytic set has a perfect subset. 13 Yet, as Godel 
observed in 1947 (page 517), progress stopped there; for it had not been 
shown that the weak continuum hypothesis holds for every II) set but 
only that an uncountable II) set has either the cardinality Mi or that of 
R—a result due to K. Kuratowski (1933, page 246). 

A second approach to the continuum problem was begun by 
Luzin (1914) and pursued vigorously in Poland by his collaborator W. 
Sierpinski. In this approach, various propositions were shown to be con¬ 
sequences of CH. By assuming CH as a hypothesis, set theorists gained 
knowledge about its strength and were able to settle various open prob¬ 
lems. Sierpinski, beginning in 1919, was especially concerned to find 
interesting propositions equivalent to CH. He summarized his results in 
a book, Hypothese du continu (1934), the source for the “paradoxical” 
consequences of CH that Godel cited in 1947. 


b Luzin 1917. For discussion of the projective hierarchy, as well as the definition 
of and II^ sets, see p. 13 above of the introductory note to 1938. 
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In 1923 D. Hilbert claimed that his recently developed proof theory 
could not only provide a foundation for mathematics but could even 
settle classical unsolved problems of set theory such as the continuum 
problem (1923, page 151). Three years later he published his attempt 
to sketch a proof, based on definability considerations, of what he called 
the “continuum theorem” (1926). This attempted proof of CH met with 
widespread skepticism, in particular from Fraenkel (1928) and from 
Luzin (1929). In 1935 Luzin returned to this question, arguing that 
there was not in fact one continuum hypothesis but rather several con¬ 
tinuum hypotheses; he dubbed as the “second continuum hypothesis” 
the following proposition contradicting CH: 

2 n ° = 2 1 * 1 . 

Finally, he argued that the second continuum hypothesis accorded with 
a proposition (contradicting CH) of whose truth he felt certain: Every 
subset of R having power Mi is a II| set (1935, pages 129-131). Godel 
referred in passing to these matters (1947, page 523) while mentioning 
that Luzin, like Godel himself, believed CH to be false. 

In the absence of a proof or refutation of CH, mathematicians could 
try to establish its undecidability on the basis of the accepted axioms of 
set theory. As early as 1923, T. Skolem conjectured that CH cannot be 
settled by Zermelo’s 1908 axiom system (Skolem 1923a, page 229). But, 
when Skolem wrote, the understanding of models of set theory was still 
very rudimentary. Luzin hoped that Hilbert’s proof theory would supply 
a consistency proof for the “second” continuum hypothesis as well as for 
CH (Luzin 1935, pages 129-131). 

During the 1920s it was also uncertain whether models of set theory 
should be studied within second-order logic, as did Fraenkel (1922a) 
and Zermelo (1929, 1930), or within first-order logic, as Skolem pro¬ 
posed (1923a, 1930). In 1930 Zermelo showed that all second-order 
models of Zermelo-Fraenkel set theory (ZF) consist of the ath stage of 
the cumulative type hierarchy, where a is a strongly inaccessible ordi¬ 
nal. In an unpublished report of about 1930 to the Emergency Society 
of German Science, Zermelo pointed out that CH is either true in all of 
these models or false in all of them, so that in either case CH is decided 
in second-order ZF. C This result contrasts with the later discoveries of 


c This report is printed in Moore 1980, pp. 130-134, and the observation on CH 
can be found on p. 134. Kreisel (1967a, pp. 99-100) also emphasized this point, 
though unaware that Zermelo had formulated it almost four decades earlier; however, 
L. Kalmar (1967, p. 104) and A. Mostowski (1967a, p. 107) reacted negatively to 
Kreisel’s observation, and the second-order version of CH has been little studied. 
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Godel and P. J. Cohen that CH is undecided in the first-order version 
of ZF. 

About 1935 Godel realized that if Zermelo’s cumulative hierarchy 
were restricted at each level to the sets first-order definable from those 
obtained at previous levels, then one would have a class model of first- 
order ZF in which various important propositions held. Originally, in 
1935, he proved only that the axiom of choice is such a proposition, but 
by 1937 he had shown that GCH holds in the model as well. In 1938 
he was inclined to accept the axiom of constructibility as true, referring 
to it as “a natural completion of the axioms of set theory” (page 557), 
and hence to believe that the generalized continuum hypothesis is also 
true. Yet Godel refrained, for more than a year, from publishing an 
announcement of these relative consistency results. A clue to his silence 
can be found in his letter, written in December 1937 to Karl Menger, 
which reveals Godel’s hopes for an even stronger result about CH: 

I continued my work on the continuum problem last summer, and 
I finally succeeded in proving the consistency of the continuum hy¬ 
pothesis (even in the generalized form 2 N " = N a+ i) with respect 
to general set theory. But I ask you, for the time being, please not 
to tell anyone about this. So far, except for you, I have communi¬ 
cated this result only to von Neumann .... Right now I am also 
trying to prove the independence of the continuum hypothesis, but 
do not yet know whether I will succeed with it .... 

Unfortunately, Godel did not succeed in proving the independence of 
CH, despite repeated attempts. 

On the other hand, Godel’s efforts to show the independence of the 
axiom of choice, and consequently of the axiom of constructibility as 
well, were more fruitful. When Cohen received the Fields Medal for 
establishing the independence of CH, A. Church pointed out, in his 
speech awarding the medal (1968, page 17), that 

Godel... in 1942 found a proof of the independence of the axiom of 
constructibility in [finite] type theory. According to his own state¬ 
ment (in a private communication), he believed that this could be 
extended to an independence proof of the axiom of choice; but due 
to a shifting of his interests toward philosophy, he soon afterward 
ceased to work in this area, without having settled its main prob¬ 
lems. The partial result mentioned was never worked out in full 
detail or put into form for publication. 

Godel also commented on his independence results in a letter of 1967 to 
W. Rautenberg, who had written to Godel inquiring about Mostowski’s 
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claim that Godel, about 1940, had obtained most of Cohen’s indepen¬ 
dence results. In his reply (written in German and translated here), 
Godel confirmed what Church had stated: 

In reply to your inquiry I would like to refer to the presentation 
of the facts that Professor Alonzo Church gave in his lecture at the 
last International Congress of Mathematicians. 

Mostowski’s assertion is incorrect insofar as I was merely in 
possession of certain partial results, namely, of proofs for the in¬ 
dependence of the axiom of constructibility and of the axiom of 
choice in type theory. Because of my highly incomplete records 
from that time (i.e., 1942) I can only reconstruct the first of these 
two proofs without difficulty. My method had a very close connec¬ 
tion with that recently developed by Dana Scott [Boolean-valued 
models] and had less connection with Cohen’s method. 

I never obtained a proof for the independence of the continuum 
hypothesis from the axiom of choice, and I found it very doubtful 
that the method that I used would lead to such a result. 

Thus there can be no doubt that Godel believed that he had obtained 
some significant independence results, but not for CH. 

By the time that Godel composed 1947 he had become convinced, 
contrary to the views he expressed in 1938 , that CH (and hence the 
axiom of constructibility as well) was false. 


3. The origins of Godel 1947 

Godel undertook to write the article 1947 at the request of Lester R. 
Ford, the editor of the American mathematical monthly. “For some time 
we have been running a series of papers ...”, Ford wrote Godel on 30 
November 1945, 

which we call the “What Is?” series. In these papers the authors 
have presented some small aspect of higher mathematics in as sim¬ 
ple, elementary and popular a way as they possibly can. We have 
had papers by both Birkhoffs, Morse, Kline, Wilder and several 
others. 

I am writing this to ask if you would like to prepare such a pa¬ 
per. The subject would be of your own choosing, but I had thought 
of “What is the problem of [the] continuum?” 


When Godel did not respond, Ford wrote again on 31 January 1946. On 
14 February, Godel, who had not received the earlier letter, expressed 
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his willingness to consider the matter, adding that “in any case I could 
not write the paper immediately, because I am unfortunately very busy 
with other things at present.” A week later, Ford replied: “Let me know 
as promptly as you can whether you can write this paper. I ought to 
have it by the month of July. It will not be a long paper and its writing 
ought not to take a great deal of time.” 

Ford did not realize that, when composing an article, Godel was an 
extreme perfectionist. Another year passed before Godel completed the 
paper that, in March 1946, he agreed to write. On 13 August 1946 Ford 
inquired about the paper, since he wished to print it before his editor¬ 
ship ended in December. Godel answered on 31 August: “The paper 
about the continuum problem ... was finished and typewritten a few 
weeks ago, but on rereading it, I found some insertions desirable, which 
I have now about completed.” Once again, this was not to be. 

Finally, on 29 May 1947, Godel sent the paper to the new editor, 
C. V. Newsom. In his covering letter, Godel mentioned that he had 
“inserted a great number of footnotes whose order does not completely 
agree with the order in which they occur in the text.” He suggested that 
the new footnotes be printed after the text of the article. Unfortunately, 
as Godel learned when he saw the article in print, the footnotes had 
been renumbered in page proof without changing the internal references 
to them. d He had received no page proofs, having returned his galley 
proofs at the last moment. Newsom apologized for the errors, which 
occurred when the compositor tried to make sense of the footnotes, and 
added by way of compensation: “Your paper has brought many compli¬ 
ments; it is by far the best article in volume 54.” 


4. How Godel viewed the continuum problem in 1947 

Godel’s essay 1947 consists of four sections: (1) a discussion of the 
notion of cardinal number, (2) a survey of the known results about the 
power 2 n ° of the continuum R, (3) a philosophical analysis of set theory, 
and (4) a proposal for solving the continuum problem. 

In Section 1, Godel stressed that Cantor’s notion of cardinal number 
is unique, provided one accepts the minimal requirement that if two sets 
have the same cardinal number, then there exists a one-to-one corre¬ 
spondence between them. Here Godel did not discuss how the notion of 
cardinal number might be defined, contenting himself with the definition 


d These errors, which Godel noted in volume 55 of the Monthly , are corrected in 
the text of 194 7 printed in the present volume. 
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of equality between cardinal numbers. In this context he introduced the 
continuum problem as the question of how many points there are on a 
Euclidean straight line (or equivalently, how many sets of integers ex¬ 
ist). This problem would lack meaning, he observed, if there were not 
a “natural” representation for the infinite cardinal numbers. But since 
the alephs provide such a representation and since, by the axiom of 
choice, the cardinal number of every set is an aleph, it follows that the 
continuum problem is meaningful. In footnote 2 he defended such uses 
of the axiom of choice by arguing, on the one hand, that this axiom is 
consistent relative to the usual axioms for set theory (as shown in his 
1940); on the other hand, he asserted that the axiom of choice is quite 
as self-evident as the usual axioms for the notion of arbitrary set and is 
even provable for “sets in the sense of extensions of definable properties” 
(that is, for the constructible sets, as well as for the ordinal-definable 
sets of his 1946). 

In Section 2, Godel reformulated the continuum problem as the ques¬ 
tion: 


Which is the cardinal number of R? 

He noted that Cantor had conjectured CH as an answer. But he did not 
mention that Cantor not only conjectured the truth of CH but also, on 
numerous occasions, claimed in print to have proved CH. (In fact, many 
mathematicians took CH as true during the 1880s and 1890s.) Nor did 
Godel distinguish between CH and the weak continuum hypothesis, re¬ 
garding them as equivalent since he assumed the axiom of choice. Later 
researchers, however, would find it necessary to distinguish carefully be¬ 
tween CH and the weak continuum hypothesis when they attempted to 
solve the continuum problem (especially when the axiom of determinacy 
was involved; cf. Section 6 below). 

Godel stressed how little was known about the power 2 N ° of R, despite 
the many years that had passed since Cantor formulated CH. Indeed, 
Godel remarked that only two facts were known: (a) 2 K ° does not have 
cofinality w and (b) the weak continuum hypothesis holds for the X} 
sets (the analytic sets), which, however, are only a tiny fraction of all 
the subsets of R. In particular, he added, it was not known whether: 

(i) There is some given aleph that is an upper bound for 2 N °, 

(ii) 2*° is accessible or is weakly inaccessible, 

(iii) 2 n ° is singular or regular, 

(iv) 2 n ° has any restrictions on its cofinality other than Konig’s result 
that its cofinality is uncountable. 

What was known, he continued, was merely a large number of proposi- 
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tions that follow from CH as well as several propositions that are equiv¬ 
alent to it. e 

Godel observed that our ignorance about the power of the continuum 
was part of a greater ignorance about infinite cardinal products. In 
particular, the power of the continuum, 2 N °, is the simplest non-trivial 
cardinal product, namely, the product of Ny copies of 2. He added that 
it was not even known whether 

(v) there is some given cardinal that is an upper bound for some 
infinite product of cardinals greater than 1. 

All that was known were certain lower bounds on infinite products, such 
as Cantor’s theorem that the product of Ho copies of 2 is greater than 
N 0 and the Zermelo-Konig theorem that if m a < n Q for all a in some 
given set I, then 

^ 11 n,,. 

a el ael 

Thus it was not even known whether the product of No copies of 2 is less 
than the product of Ni copies of 2, that is, whether 

2«o <■ 2 n i . 

In Section 3 Godel argued that this lack of knowledge was not due 
entirely to a failure to find the appropriate proofs, but stemmed from 
the fact that the concept of set required “a more profound [conceptual] 
analysis ... than mathematics is accustomed to give” (page 518). He be¬ 
gan his philosophical analysis of this concept by rejecting intuitionism, 
because it is destructive of set theory, and by laying aside the semi- 
intuitionistic viewpoints of Poincare and Weyl for the same reason. In¬ 
stead, he insisted that axiomatic set theory provides the proper founda¬ 
tion for Cantorian set theory. Protecting himself against the objection 
that the paradoxes threaten set theory, he asserted that no paradox has 
ever emerged for the iterated notion of “set of” (the cumulative type 
hierarchy V^). f Here Godel permitted a set of urelements (the integers, 
for example) as the basis from which the cumulative hierarchy is built 
up; incidentally, this corroborates the view that he adopted the cumu¬ 
lative hierarchy from Zermelo 1930. Finally, Godel insisted that the 
continuum problem—if formulated in a combinatorial way as the ques¬ 
tion whether CH can be deduced from the axioms of set theory—retains 


e What is now known about (i)-(iv) is discussed in Section 6 below. 
f The cumulative hierarchy V a is also called R(a). On this hierarchy, see p. 4 
above of the introductory note to 1938. 
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a meaning, independently of one’s philosophical standpoint, even for the 
most extreme intuitionist. 

If the usual axioms of set theory are consistent, Godel remarked, then 
CH is either provable, disprovable, or undecidable. After noting that his 
1940 ruled out the second possibility, he asserted that the third one is 
probably correct. To attempt to establish that CH is undecidable, he 
insisted, was the most promising way of attacking the problem.® 

What is especially important, however, is this: Although Godel ar¬ 
gued that CH is almost certainly independent from ZF (as formulated 
in first-order logic), he insisted strongly that a proof of its independence 
would not solve the continuum problem. Indeed, he emphasized, as Zer- 
melo (1930) had done, that “the axioms of set theory by no means form 
a system closed in itself, but, quite on the contrary, the very concept of 
set on which they are based [the cumulative hierarchy] suggests their 
extension by new axioms which assert the existence of still further itera¬ 
tions of the operation ‘set of’ "(1947, page 520). Consequently, he urged 
mathematicians to search for new large cardinal axioms which would, 
he hoped, decide CH. He added, with his incompleteness theorems in 
mind, that such axioms would settle questions about Diophantine equa¬ 
tions undecidable by the usual axioms. 

Here Godel’s strongly held Platonism was visible, as it had been in 
1944 and as it would be even more strongly in 1964 • If the undecidabil¬ 
ity of Cantor’s conjecture CH were established, he stressed, this would 
not settle the continuum problem—for essentially philosophical reasons. 
In fact, he wrote (1947, page 520), 

only someone who (like the intuitionist) denies that the concepts 
and axioms of classical set theory have any meaning (or any well- 
defined meaning) could be satisfied with such a solution, not some¬ 
one who believes them to describe some well-determined reality. 
For in this reality Cantor’s conjecture must be either true or false, 
and its undecidability from the axioms as known today can only 
mean that these axioms do not contain a complete description of 
this reality. 

After granting that all large cardinal axioms known at the time failed 
to settle CH, since all of them were consistent with the axiom of con- 
structibility, Godel made an eloquent plea for new axioms (1947, page 
521): 


s On the other hand, Godel did not mention that in 1923a Skolem had also argued 
for the independence of CH, nor that he himself had worked intensively at establishing 
its independence during 1942 (as his Arbeitshefte attest). 
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Even disregarding the intrinsic necessity of some new axiom, and 
even in case it had no intrinsic necessity at all, a decision about its 
truth is possible also ... inductively by studying ... its fruitfulness 
in consequences and in particular in ... consequences demonstra¬ 
ble without the new axiom, whose proofs by means of the new 
axiom, however, are considerably simpler and easier to discover, 
and make it possible to condense into one proof many different 
proofs_ There might exist axioms so abundant in their verifi¬ 

able consequences, shedding so much light upon a whole discipline, 
and furnishing such powerful methods for solving given problems 
(and even solving them, as far as that is possible, in a construc- 
tivistic way) that quite irrespective of their intrinsic necessity they 
would have to be assumed at least in the same sense as any well- 
established physical theory. 

This allusion to physics illustrates his view (already stated in 1944 > 
page 137) that the assumption of an underlying reality is as “necessary 
to obtain a satisfactory theory of mathematics” as the assumption of 
the reality of physical objects is “necessary for a satisfactory theory of 
our sense perceptions”. 

In Section 4, Godel returned to his conjecture that CH is not decided 
by the usual axioms for set theory, arguing that there were at least two 
reasons for expecting such undecidability. The first was that there exist 
two quite different classes satisfying the usual axioms: the class of con- 
structible sets and the class of “sets in the sense of arbitrary multitudes” 
(page 521). Thus he believed that one could not expect CH to be settled 
if one did not specify axiomatically which of these two classes was being 
considered. (He did not mention here, perhaps for philosophical reasons, 
a third such class, namely the class of ordinal-definable sets, to which 
he alluded in footnote 26. h ) Half of his conjecture about undecidability 
had already been verified, namely the relative consistency of CH with 
the usual axioms, since CH is true in the class of constructive sets. 

Godel then made the important suggestion that “from an axiom in 
some sense directly opposite to this [axiom of constructibility] the nega¬ 
tion of Cantor’s conjecture [C77] could perhaps be derived” (page 522). 
The difficulty, of course, with Godel’s suggestion resides in the phrase 
“directly opposite”, since he himself rightly believed that the mere nega- 


h At first glance it might appear that in footnote 20 he conflated the class of 
ordinal-definable sets, introduced in 1946, with the class of constructible sets. How¬ 
ever, by comparing footnote 20 with footnote 26, one sees that in the earlier footnote 
he had in mind the constructible sets and, in the latter, the ordinal-definable sets. 
Likewise, in footnote 21 of 1964 he meant the constructible sets rather than the 
ordinal-definable ones. 
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tion of the axiom of constructibility would not suffice for this purpose 
(see Section 5). Yet insofar as the axiom of constructibility is a min¬ 
imality axiom (expressing that the power set of a set, and hence the 
universe, is as small as possible), he may have had in mind here some 
kind of maximality axiom, as he certainly did in 1964 (see pages 167-168 
below). 

Godel’s second reason for expecting the independence of CH was that 
CH has certain “paradoxical” consequences which he found unlikely to 
be true—in particular, the existence of certain very thin subsets of R 
that have the power 2 H °. The first effect of CH was to ensure that some 
kinds of thin subsets of R, proved in ZFC to have instances that are 
uncountable, can actually have the power 2 K °. Examples of such sets 
are 

(1) sets of first category on every perfect subset of R, 
and 

(2) sets carried into a set of measure zero by every continuous one- 
to-one mapping of R onto itself. 

The second effect of CH was to imply that certain kinds of thin sub¬ 
sets of R can have the power 2**° even though, in ZFC, no instances of 
these kinds are known that are uncountable. Here he gave as an example 
the sets of absolute measure zero (by definition, such a set is coverable 
by a given sequence of intervals of arbitrarily small positive lengths). 
He then gave several other examples, such as a subset of R including no 
uncountable set of measure zero. 1 

Godel attempted to protect himself against the rejoinder that many 
kinds of point-sets obtained without CH (such as a Peano curve) are 
highly counterintuitive. In these cases, he argued, the implausibility of 
the point-sets was due to “a lack of agreement between our intuitive 
geometrical concepts and the set-theoretical ones occurring in the theo¬ 
rems” (page 524). 

Nevertheless, there appears to be little evidence that analysts and 
set theorists now regard as “paradoxical” the kinds of thin sets cited by 
Godel. For example, P. J. Cohen, when asked his opinion of these thin 
sets of power 2 N °, was not troubled by them.l Likewise, in an article sur¬ 
veying recent work on CH , D. A. Martin responded negatively to Godel’s 
claim: “While Godel’s intuitions should never be taken lightly, it is very 
hard to see that the situation [with CH\ is different from that of Peano 
curves, and it is even hard for some of us to see why the examples Godel 
cites are implausible at all” (1976, page 87). 

In the conclusion to his article, Godel insisted that “it is very suspi- 


‘This particular example, however, was dropped in his 1964 version of the article. 
J Personal communication from P. J. Cohen, April 1984. 
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cious that, as against the numerous plausible propositions which imply 
the negation of the continuum hypothesis, not one plausible proposition 
is known which would imply the continuum hypothesis” (1947, page 
524). What are these “numerous plausible propositions”? We cannot be 
certain, since Godel did not mention even one of them explicitly. Per¬ 
haps he simply intended such propositions to be the negations of those 
that he had called “paradoxical”. In any case, here he was uncharac¬ 
teristically incautious in his assertion. In 1970 he himself would find a 
proposition, which he then regarded as quite plausible, that implies CH 
(see Section 7). 


5. Godel’s altered perspective in 1964 

The article 1964 resulted from a request, made to Godel by P. 
Benacerraf and H. Putnam, for permission to reprint both of the essays 
1944 and 1947 in their forthcoming source book Philosophy of mathe¬ 
matics'. Selected readings. At first, Godel hesitated to grant permission, 
fearing that the introduction to their book would subject his article 
to positivistic attacks. He asked Benacerraf, in conversation, for what 
amounted to editorial control of the editor’s introduction to the source 
book. As an alternative, since such control could not be granted, Be¬ 
nacerraf assured Godel that he would be shown the introduction and, 
furthermore, that the editors did not intend it to make a major philo¬ 
sophical statement but rather to outline the issues. Thus placated, Godel 
gave permission to reprint his two essays, and began extensively revising 
1947. Benacerraf met with Godel a number of times to go over the revi¬ 
sions, since Godel felt that he did not know English “well enough”. Yet 
Benacerraf knew no one with a more subtle grasp of the various ways 
in which an English text could be interpreted. While considering the 
proposed changes, Godel repeatedly pointed out to Benacerraf various 
of their unwanted consequences. 15 

Whereas Godel made no substantive modifications in reprinting 1944, 
merely adding an initial footnote, he introduced more than one hundred 
separate alterations in 1947 in the course of preparing 1964 • Most of 
these changes were stylistic and reflected his increasing acquaintance 
with the nuances of the English language. In particular, a number of 
long and rather Germanic sentences were divided into shorter and more 
idiomatic ones. 


‘Personal communications from P. Benacerraf, July 1982 and March 1986. 
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Nevertheless, a substantial number of his changes were more than 
stylistic. A minor example is his reference in 1947to & “natural” repre¬ 
sentation of the infinite cardinal numbers (the alephs), replaced in 1964 
with a reference to a “systematic” representation. Far more surprising is 
his omission in 1964 of all reference to the ordinal-definable sets, which 
in 194 7 he had discussed on page 522 and in footnote 26. It is uncertain 
what prompted him to omit this notion of set that he had introduced in 
his 1946. 

In Godel’s Nachlass there exist two drafts of his 1964, each an offprint 
of 194 7 with alterations written on it. The second of these contains a 
revision, not incorporated into 1964 , that credits Zermelo (1930) with 
“substantially the same solution of the paradoxes” as is embodied in the 
cumulative type hierarchy, which Godel designates by his notion “set 
of”. Again, it is unknown why he intended to credit Zermelo and then 
decided not to do so. 

One particularly important addition occurred in footnote 20 of 1964, 
where large cardinal axioms were discussed. Here he remarked that D. 
Scott (1961) had proved that the existence of a measurable cardinal 
contradicts the axiom of constructibility—in contrast to earlier large 
cardinal axioms, such as those of Mahlo (1911, 1913), which are con¬ 
sistent with that axiom. Consequently, he continued, the relative con¬ 
sistency proof for CH by means of the class of constructible sets fails if 
one assumes that there is a measurable cardinal. (In 1971a, however, 
J. Silver established that GCH holds in the class of sets constructible 
from a countably additive measure on the least measurable cardinal. In 
1967, Levy and Solovay had already shown, by means of forcing, that 
CH is relatively consistent with a measurable cardinal; see footnote p 
below.) Godel then added that it was not yet certain whether “the gen¬ 
eral concept of set” implies the existence of a measurable cardinal in the 
same way as it implies Mahlo’s axioms. By contrast with this uncer¬ 
tainty, in Gddel’s unpublished revision of September 1966 he argued for 
the existence of a measurable cardinal since this follows “from the ex¬ 
istence of generalizations of Stone’s representation theorem to Boolean 
algebras with operations on infinitely many elements” (page 261 below).’ 

Another noteworthy addition occurred in footnote 23 of 1964- 
Whereas in the 1947 version of this footnote, Godel had argued that 
CH might be decided by means of some axiom diametrically opposite to 
the axiom of constructibility, in 1964 he spelled out what he meant: 


'See also Godel’s oral comments about measurable cardinals to Solovay on p. 19 
above. 
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I am thinking of an axiom which (similar to Hilbert’s completeness 
axiom in geometry) would state some maximum property of the 
system of all sets, whereas axiom A [the axiom of constructibility] 
states a minimum property. Note that only a maximum property 
would seem to harmonize with the concept of set explained in foot¬ 
note 14 [arbitrary sets of the cumulative type hierarchy]]. 

Hilbert’s axiom of completeness {1902), which belongs to second-order 
logic, had characterized Euclidean geometry (and, analogously, the real 
numbers) as the maximal structure satisfying his other axioms. What 
Godel proposed for set theory was vague but suggestive; in particular, 
the various large cardinal axioms can be regarded as steps in the di¬ 
rection of maximality. His meaning is made more definite by a letter 
he wrote to S. Ulam (quoted in Ulam 1958, page 13) apropos of von 
Neumann’s axiom ( 1925 ) that a class 5 is a proper class if and only if 
S is equipotent with the class V of all sets: 

The great interest which this axiom has lies in the fact that it 
is a maximum principle, somewhat similar to Hilbert’s axiom of 
completeness in geometry. For, roughly speaking, it says that any 
set which does not, in a certain well-defined way, imply an incon¬ 
sistency exists. Its being a maximum principle also explains the 
fact that this axiom implies the axiom of choice. I believe that the 
basic problems of abstract set theory, such as Cantor’s continuum 
problem, will be solved satisfactorily only with the help of stronger 
axioms of this kind, which in a sense are opposite or complementary 
to the constructivistic interpretation of mathematics. 

More recent attempts to formulate such a maximum principle have 
not been completely successful. J. Friedman {1971) proposed one such 
proposition, called the generalized maximization principle, and showed 
it to be equivalent to GCH ; thus far it has attracted little attention. Re¬ 
cently, S. Shelah’s strong version of his proper forcing axiom, PFA+ (by 
which, in 1982 , he generalized Martin’s axiom in the direction of maxi¬ 
mality), and the principle dubbed “Martin’s maximum” by M. Foreman, 
M. Magidor and Shelah have each been shown (by them in 198?, and 
independently by S. Todorcevic) to imply that 2 N ° = M 2 ; more recently, 
Todorcevic has announced a proof that 2 N ° = N 2 already follows from 
PFA. At present, there is no consensus among set theorists as to the 
truth of these hypotheses. Nor does the author wish to conjecture what 
Godel would have thought of them. 

By far the most substantial alteration in Godel 1964 was the addition 
of a long supplement, together with a brief postscript noting that Cohen 
{1963, 1964) had just established the independence of CH and thereby 



Note to 1947 and 1964 


169 


had verified Godel’s 1947 claim that CH would not be settled by the 
usual axioms for set theory. The supplement consists of a discussion of 
new results that Godel considered important, along with an extended 
philosophical defense of his Platonist position on CH. 

Ostensibly, this defense was stimulated by A. Errera’s article 1952, 
claiming that if CH is not decided by the usual axioms for set the¬ 
ory, then the question whether CH is true will lose its meaning, just as 
happened to the parallel postulate when non-Euclidean geometry was 
proved consistent. Gddel insisted that, on the contrary, “the situation 
in set theory is very different from that in geometry, both from the 
mathematical and from the epistemological point of view” {1964, P a g e 
270). Here he stressed the asymmetry between assuming that there is, 
and assuming that there is not, a strongly inaccessible cardinal. The 
former assumption was fruitful in the sense of having consequences for 
number theory, while the latter was not. Likewise, he continued, CH 
“can be shown to be sterile for number theory ..., whereas for some 
other assumption about the power of the continuum this perhaps is not 
so” (page 271). This “sterility”, for first-order number theory, was due 
to the fact that N is absolute for L, the class of all constructible sets. 
(In his revisions of 1966-1967, discussed below, he here replaced CH by 
GCH, and “power of the continuum” by “power of 2 K “”.) 

By using later results, we can say more. In 1969 R. A. Platek estab¬ 
lished that if a sentence of second-order number theory is provable from 
CH, then it is already provable from the usual axioms of set theory along 
with the axiom of choice; moreover, he showed that the same holds for 
any III sentence of third-order number theory. 1 " (No further extension 
was possible, since CH itself is a Ef sentence.) By 1965 Solovay had 
independently found Platek’s result on CH, and in addition had discov¬ 
ered a corresponding result for not- CH : If a n| sentence of second-order 
number theory is provable from not- CH, then it is already provable from 
ZF and the axiom of choice." In this sense, then, both CH and not -CH 
are sterile for number theory. 

The Platonist views put forward by Godel in 1947were strengthened 
in 1964, not only in the supplement but in the text as well, where he 
described himself as “someone who considers mathematical objects to 
exist independently of our constructions” (page 262). Nevertheless, his 
Platonism was most visible in the supplement, where on page 271 he 
pursued at some length the analogy between mathematics and physical 
theories that he had already broached in 1947: 


m S. Kripke and J. Silver had each independently arrived at the same result (Platek 
1969, p. 219). 

"Personal communication from R. M. Solovay, 27 October 1984. 
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Despite their remoteness from sense experience, we do have some¬ 
thing like a perception also of the objects of set theory, as is seen 
from the fact that the axioms force themselves upon us as being 
true. I don’t see any reason why we should have less confidence 
in this kind of perception, i.e., in mathematical intuition, than in 
sense perception, which induces us to build up physical theories 
and to expect that future sense perceptions will agree with them, 
and, moreover, to believe that a question not decidable now has 
meaning and may be decided in the future. 

In September 1966, Godel wrote an addendum called “Changes to be 
made in 3rd edition”, anticipating that 1964 would be reprinted. 0 Al¬ 
ready in the postscript to 1964, which was added when 1964 was almost 
in press, Godel had mentioned Cohen’s 1963 proof of the independence of 
CH. But in the 1966 addendum Godel expressed himself more strongly: 
“Cohen’s work ... is the greatest advance in the foundations of set the¬ 
ory since its axiomatization”. He added that Cohen’s forcing “has been 
used to settle several other important independence questions”; yet he 
mentioned only one result, namely, that all known large cardinal axioms 
“are not sufficient to answer the question of the truth or falsehood of 
Cantor’s continuum hypothesis” (page 270 below). Although he did not 
give a reference, he was almost certainly referring to the result of Levy 
and Solovay that, for all known large cardinals k (and in particular for 
measurable cardinals), if there is a model of set theory containing k, 
then there is a model containing k in which CH is true and another 
model containing k in which CH is false. p 

6. Later research affecting 1947 and 1964 

There were two major developments that affected Godel’s program, 
as proposed in 1947 and 1964, for settling CH. The first of these was re¬ 
search on large cardinals, and the second consisted of new independence 
results obtained by Cohen’s method of forcing. In fact, there has been 
an extremely fruitful interaction, which still continues, between these 
two lines of development. 


“These changes have been incorporated into the text of 1964 in the present vol¬ 
ume, where they are printed in square brackets. Godel made additional changes in 
a manuscript of October 1967. The textual notes record the exact changes to 1964 
made in 1966 and 1967. On the other hand, the reprinting of 1964 in Benacerraf and 
Putnam 1983 does not include these alterations and additions. 

p This result, announced in Levy 1964 an <i independently in Solovay 1965a, was 
proved in detail in Levy and Solovay 1967. 
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The question of the relationship of CH to large cardinal axioms, and 
to new axioms such as the axiom of determinacy (AD), has turned out 
to be unexpectedly complicated. Large cardinal axioms are now known 
to affect the class of sets for which the weak continuum hypothesis is 
true. In particular, Solovay showed (1969) that if there exists a measur¬ 
able cardinal, then the weak continuum hypothesis is true for sets. 
Moreover, AD, which may be regarded as a kind of large cardinal ax¬ 
iom, implies that the weak continuum hypothesis holds for every subset 
of R. Unfortunately, AD contradicts CH, since it implies that the real 
numbers cannot be well-ordered (Mycielski 1964, page 209), and so was 
surely unacceptable to Godel as a solution to the continuum problem. 
On the other hand, the axiom of projective determinacy (that is, AD 
restricted to the projective sets) is also a kind of large cardinal axiom 
and has recently been shown to be consistent with the axiom of choice, 
provided a sufficiently large cardinal exists. Indeed, D.A. Martin and 
J.R. Steel (198?) have recently established, among other things, that 
if there is a supercompact cardinal (or, what is weaker, infinitely many 
Woodin cardinals), then projective determinacy is true and hence the 
weak continuum hypothesis is true for all projective sets. q 

The second line of development, independence proofs, profoundly af¬ 
fected Godel’s program. In 1963 Cohen established not only that CH is 
independent but also that 2 H ° can be arbitrarily large among the alephs. 
Feferman then showed that it is consistent with ZF to have 2 N ° = 2 Nl , 
Luzin’s second continuum hypothesis (Cohen 1964, page 110). From 
Cohen’s work it followed, in regard to (i)-(iv) on page 161 above, that 
2 h ° is not bounded above by any given aleph and can be either acces¬ 
sible or weakly inaccessible, singular or regular; moreover, there are no 
restrictions on the cofinality of 2 N ° other than Konig’s theorem. Solo¬ 
vay independently determined the a for which 2 H ° = is consistent, 
namely all of uncountable cofinality (1965). Thus it was shown that 
our ignorance regarding (i)-(iv) is inevitable if we assume only the usual 
first-order axioms of set theory. (In 1964 , Godel was inclined to believe 
that 2 h ° is rather large, and favored the proposition that 2 H ° is the first 
weakly inaccessible cardinal (1964, page 270).) 

Shortly after Cohen announced his results in 1963, research on the 
continuum problem turned to establishing what are the possibilities for 
the continuum function F(tt a ) = 2 Nq , defined on all ordinals. The first 
major breakthrough was Easton’s theorem (1964, 1970) that the contin¬ 
uum function F can, on regular cardinals, be any nondecreasing function 


q By combining this result with earlier work of Woodin, one obtains from a su¬ 
percompact cardinal the existence of a transitive class model of ZF + AD + DC 
containing all real numbers and all ordinals. 
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for which the cofinality of F(H a ) is greater than For a decade there 
was a consensus among set theorists that something analogous to Eas¬ 
ton’s result would also be shown for singular cardinals. Both Easton 
and Solovay, among others, attempted to solve what came to be called 
the singular cardinals problem. 

Consequently, set theorists were quite surprised in 1974 when Silver 
established that if GCH holds below a singular cardinal k of uncount¬ 
able cofinality, then it holds at k as well (Silver 1975). Even this result, 
however, by no means settled the singular cardinals problem—provided 
that this problem is taken as asking for all the laws about cardinal ex¬ 
ponentiation relative to singular cardinals. A first step occurred when 
Bukovsky (1965) proved that cardinal exponentiation is determined by 
the so-called gimel function —a result that Godel had stated but 
not proved in 1947 (page 517). 

One important spinoff of Silver’s result was Jensen’s covering theo¬ 
rem (Devlin and Jensen 1975), which states that if the large cardinal 
axiom asserting the existence of 0# is false, then the singular cardinals 
hypothesis is true/ This hypothesis asserts that the continuum function 
F(N a ) = 2 N “ is determined by its behavior at regular Thus, al¬ 
though known large cardinal axioms did not settle CH, the negation of 
a large cardinal axiom settled the behavior of the continuum function F 
at singular cardinals. 

Silver’s result was extended by Galvin and Hajnal (1975) for the case 
where k is a singular strong limit cardinal of uncountable cofinality. For 
such a «, they found an upper bound on 2 K in terms of the behavior of 2 A 
for a stationary set of A < k. Somewhat earlier, in 1974, Solovay proved 
that if k is strongly compact, then there is a proper class of cardinals for 
which GCH holds, namely, the class of singular strong limit cardinals 
greater than k. 

Magidor (1977) established that Silver’s assumption of uncountable 
cofinality is necessary. In particular, Magidor showed, using a very 
large cardinal, that if GCH holds below K w , then it may happen that 
2 *G = n w+2 . Shelah (1982) obtained a bound on 2 N “ under the as¬ 
sumption that GCH holds below Furthermore, Shelah discovered 
an analogue of the Galvin-Hajnal result for singular cardinals of count¬ 
able cofinality. Finally, using a large cardinal assumption, Foreman and 
Woodin found a model of ZFC in which GCH fails everywhere; Woodin 
later improved this to 2 N “ = H Q+ 2 for all a. (It is known, thanks to an 
earlier result of L. Patai, that if, for all a and for a fixed /?, 2 N “ = K a+/ a, 
then P is finite; see Jech 1978, pages 48 and 580.) 


r Concerning 0#, see p. 21 above of the introductory 


to 1938 . 
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Recently Foreman (1986) has proposed the axiom of resemblance, 
which he regards as a generalization of large cardinal axioms, and has 
announced that it implies both GCH and the axiom of projective de- 
terminacy. (He has shown in 1986 that, from CH and the axiom of 
resemblance, GCH follows.) For Godel, however, the fact that the ax¬ 
iom of resemblance implies GCH would probably have disqualified it as 
settling the continuum problem. s 


7. Godel’s unpublished papers on CH 

After his proposal for using large cardinal axioms to decide CH did 
not succeed, Godel introduced other axioms that he hoped would decide 
it. In January 1964, before he knew that such axioms, and in particular 
the existence of a measurable cardinal, did not settle CH, he wrote to 
Cohen about a related question: 

Once the continuum hypothesis is dropped, the key problem con¬ 
cerning the structure of the continuum, in my opinion, is the ques¬ 
tion of whether there exists a set of sequences of integers of power 
Ni which, for any given sequence of integers, contains one majoriz¬ 
ing it from a certain point on- 1 always suspected that, in 

contrast to the continuum hypothesis, this proposition is correct 
and perhaps even demonstrable from the axioms of set theory. 

Six years later, Godel postulated the existence of such a set of sequences 
as one of his axioms, now called Godel’s square axioms, which were in¬ 
tended to resolve the continuum problem. 

The square axioms are an axiom schema stating that, for each natural 
number n, there exists a scale, of type w n+ i, of functions from ui n to 
w„.‘ Perhaps Godel was led to formulate the square axioms by reading 
Borel 1898, which he cites. On page 116, Borel claimed that there exists 
a scale for the case n = 0 for all “effectively defined” functions, though 
he did not give a proof of his claim. 

Godel introduced these axioms in his final contribution to solving the 
continuum problem, a short paper written in 1970 and entitled “Some 
considerations leading to the probable conclusion that the true power of 
the continuum is N2”, which he intended to publish in the Proceedings 


3 A recent argument that 2 N< > > can be found in Freiling 1986. 

‘In other words, let F be the set of functions from ui n to u> rl ; then F has a subset 
S of power N„ + , such that for any function f in F there is some function g in S such 
that for some a and for all fl > «, J(fi) < g{f)). 
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of the National Academy of Sciences.' 1 In this paper he proposed four 
axioms (or axiom schemas), of which the square axioms were the first. 
The second axiom asserted that there are exactly K,„ initial segments of 
the scale given by the square axioms. The third axiom was that there 
exists a maximal scale of functions from N to R such that “every as¬ 
cending or descending sequence has cofinality c o". The fourth and final 
axiom consisted of the Hausdorff continuity axiom for the scale given by 
Godel’s third axiom. (Axiom 4 implies that 2**° = 2^ J .) 

Godel mailed his paper to Tarski, who then asked Solovay to examine 
its correctness. D. A. Martin, to whom Solovay had sent a copy of the 
paper, found that a result in it contradicted a theorem of Solovay’s. In 
particular, Martin observed, since Solovay had shown that the square 
axioms do not put an upper bound on the size of 2 K °, Godel had to 
be mistaken in his claim that these axioms yield the result that 2 N ° is 
bounded by H 2 . v On 19 May 1970 Tarski returned the paper to Godel, 
adding in his covering letter that “you will certainly hear still in this 
matter either from me or from somebody else in Berkeley.” 

The whole matter was tinged with irony. For by 1965, having be¬ 
come convinced of the proposition that the square axioms do put an 
upper bound on 2 K °, Godel discussed this proposition with Solovay at 
the Institute for Advanced Study. At Godel’s request Solovay looked 
into the matter and found that there are models of set theory satisfy¬ 
ing the square axioms but having 2**° arbitrarily large. Godel remained 
unconvinced, despite K. Prikry’s assurances that Solovay was correct. w 
Solovay’s result had to be rediscovered independently by E. Ellentuck 
(about 1973) before Godel came to accept it. x 

In 1970, not long after receiving Tarski’s letter, Godel drafted a sec¬ 
ond version of his paper on CH, entitled “A proof of Cantor’s continuum 
hypothesis from a highly plausible axiom about orders of growth”. His 
title represented a sudden and unexpected shift in his longstanding re¬ 
jection of CH. This change in attitude appears to have been due to his 
belief that the square “axioms for H n (or even any regular ordinal) are 
highly plausible, much more so than the continuum hypothesis.” Indeed, 
he claimed that CH follows from the square axiom for Hi (that is, for 
n = 1). In conclusion, he wrote: 


u The various versions of this paper are being considered for inclusion in Volume 
III of these Collected works. 

v Personal communication from D. A. Martin and R. M. Solovay, 13 February 
1984. 

w Personal communication from R. M. Solovay, 4 April 1984. 

x Ellentuck only learned of Solovay’s priority for this result after finding it himself; 
sec Ellcntuck’s note, dated February 1973, in Godcl’s Nachlass. 
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It seems to me this argument gives much more likelihood to the 
truth of Cantor’s continuum hypothesis than any counterargument 
set up to now gave to its falsehood, and it has at any rate the 
virtue of deriving the power of the set of all functions ca —> w 
from that of certain very special sets of these functions. Of course 
the argument can be applied to higher cases of the generalized 
continuum hypothesis (in particular to all H n ). It is, however, 
questionable whether the whole generalized continuum hypothesis 
follows. 

At the top of this second version Godel had written “nur fiir mich 
geschrieben” (“written only for myself”). It is unclear who, if anyone, 
saw this version before Godel’s death. 

A third version of the paper (so Godel described it) was a draft of a 
letter to Tarski, apparently never sent, that survives in Gddel’s Nach- 
lass. This letter is much closer in spirit to the first version of the paper 
than to the second. In the letter Godel stated that he had written the 
first version hurriedly right after an illness for which he had been taking 
medication. What he had proved, he now believed, “is a nice equivalence 
result for the generalized continuum hypothesis ... [showing that it] fol¬ 
lows from certain very special and weak cases of it.” Godel concluded 
the letter with some speculations: 

My conviction that 2 N ° = N 2 of course has been somewhat shaken. 
But it still seems plausible to me. One of my reasons is that I don’t 
believe in any kind of irrationality such as, e.g., random sequences 
in any absolute sense. Perhaps 2 N ° = K 2 does follow from my 
axioms 1-4, but unfortunately Axiom 4 is rather doubtful, while 
axioms 1-3 seem extremely likely to me. 

Yet he conceded that Axioms 1-3 do not imply 2 N ° < N 2 . y 

Thus ended Godel’s last attempt to settle the continuum problem, 
which he had analyzed so brilliantly in 1947 and 1964- z 

Gregory H. Moore 


y Before 1973, Godel’s square axioms were studied by G. Takeuti, who established 
that the existence of a scale from <n 1 to uiq implies OH. These axioms are also inves¬ 
tigated in Ellentuck 1975, Takeuti 1978 and P. E. Cohen 1979. 

H would like to thank S. Feferman for many substantive suggestions, and J. 
Dawson for many stylistic ones, to an earlier draft of this introductory note; Dawson 
has also been of considerable assistance on archival matters. I am especially grateful 
to R. M. Solovay for his many useful suggestions regarding Section 6. 





What is Cantor’s continuum problem? 

(mi) 

1. The concept of cardinal number 
Cantor’s continuum problem is simply the question: How many points 
are there on a straight line in Euclidean space? In other terms, the question 
is: How many different sets of integers do there exist? 

This question, of course, could arise only after the concept of “number” 
had been extended to infinite sets; hence it might be doubted if this exten¬ 
sion can be effected in a uniquely determined manner and if, therefore, the 
statement of the problem in the simple terms used above is justified. Closer 
examination, however, shows that Cantor’s definition of infinite numbers 
really has this character of uniqueness, and that in a very striking manner. 
For whatever “number” as applied to infinite sets may mean, we certainly 
want it to have the property that the number of objects belonging to some 
class does not change if, leaving the objects the same, one changes in any 
way whatsoever their properties or mutual relations (e.g., their colors or 
their distribution in space). From this, however, it follows at once that 
two sets (at least two sets of changeable objects of the space-time world) 
will have the same cardinal number if their elements can be brought into 
a one-to-one correspondence, which is Cantor’s definition of equality be¬ 
tween numbers. For if there exists such a correspondence for two sets A 
and B it is possible (at least theoretically) to change the properties and 
relations of each element of A into those of the corresponding element of 
B, whereby A is transformed into a set completely indistinguishable from 
B, hence of the same cardinal number. For example, assuming a square 
and a line segment both completely filled with mass points (so that at each 
point of them exactly one mass point is situated), it follows, owing to the 
demonstrable fact that there exists a one-to-one correspondence between 
the points of a square and of a fine segment, and, therefore, also between 
the corresponding mass points, that the mass points of the square can be 
so rearranged as exactly to fill out the line segment, and vice versa. Such 
considerations, it is true, apply directly only to physical objects, but a 
definition of the concept of “number” which would depend on the kind of 
objects that are numbered could hardly be considered as satisfactory. 

So there is hardly any choice left but to accept Cantor’s definition of 
equality between numbers, which can easily be extended to a definition 
of “greater” and “less” for infinite numbers by stipulating that the cardi¬ 
nal number M of a set A is to be called less than the cardinal number 
TV of a set B if M is different from N but equal to the cardinal number 
of some subset of B. On the basis of these definitions it becomes possible to 
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prove that there exist infinitely many different infinite cardinal numbers or 
“powers”, and that, in particular, the number of subsets of a set is always 
greater than the number of its elements; furthermore, it becomes possible 
to extend (again without any arbitrariness) the arithmetical operations to 
infinite numbers (including sums and products with any infinite | number of 516 
terms or factors) and to prove practically all ordinary rules of computation. 

But, even after that, the problem to determine the cardinal number of 
an individual set, such as the linear continuum, would not be well-defined 
if there did not exist some “natural” representation of the infinite cardinal 
numbers, comparable to the decimal or some other systematic denotation 
of the integers. This systematic representation, however, does exist, owing 
to the theorem that for each cardinal number and each set of cardinal 
numbers 1 there exists exactly one cardinal number immediately succeeding 
in magnitude and that the cardinal number of every set occurs in the series 
thus obtained. 2 This theorem makes it possible to denote the cardinal 
number immediately succeeding the set of finite numbers by Mo (which is 
the power of the “denumerably infinite” sets), the next one by Mi, etc.; the 
one immediately succeeding all Mj (where i is an integer), by M w , the next 
one by M^+i, etc., and the theory of ordinal numbers furnishes the means 
to extend this series farther and farther. 


2. The continuum problem, the continuum hypothesis 
and the partial results concerning its truth 
obtained so far 

So the analysis of the phrase “how many” leads unambiguously to quite 
a definite meaning for the question stated in the second line of this paper, 
namely, to find out which one of the M’s is the number of points on a 
straight line or (which is the same) on any other continuum in Euclidean 
space. Cantor, after having proved that this number is certainly greater 
than Mo, conjectured that it is Mi, or (which is an equivalent proposition) 
that every infinite subset of the continuum has either the power of the set of 
integers or of the whole continuum. This is Cantor’s continuum hypothesis. 


1 As to the question why there does not exist a set of all cardinal numbers, see 
footnote 14. 

2 In order to prove this theorem the axiom of choice (see Fraenkel 1928, p. 288 ff.) 
is necessary, but it may be said that this axiom is, in the present state of knowledge, 
exactly as well-founded as the system of the other axioms. It has been proved consistent, 
provided the other axioms are so (see Godel 1940). It is exactly as evident as the other 
axioms for sets in the sense of arbitrary multitudes and, as for sets in the sense of ex¬ 
tensions of definable properties; it also is demonstrable for those concepts of definability 
for which, in the present state of knowledge, it is possible to prove the other axioms, 
namely, those explained in footnotes 20 and 26. 
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But, although Cantor’s set theory has now had a development of more 
than sixty years and the problem is evidently of great importance for it, 
nothing has been proved so far relative to the question what the power of 
the continuum is or whether its subsets satisfy the condition just stated, 
except (1) that the power of the continuum is not a cardinal number of a 
certain very special kind, namely, not a limit of denumerably many smaller 
cardinal numbers, 3 and (2) that the proposition just mentioned about the 
517 subsets of the continuum is | true for a certain infinitesimal fraction of these 
subsets, the analytical 4 sets. 5 Not even an upper bound, however high, can 
be assigned for the power of the continuum. Nor is there any more known 
about the quality than about the quantity of the cardinal number of the 
continuum. It is undecided whether this number is regular or singular, 
accessible or inaccessible, and (except for Konig’s negative result) what its 
character of cofinality 4 is. The only thing one knows, in addition to the 
results just mentioned, is a great number of consequences of, and some 
propositions equivalent to, Cantor’s conjecture. 6 

This pronounced failure becomes still more striking if the problem is 
considered in its connection with general questions of cardinal arithmetic. 
It is easily proved that the power of the continuum is equal to 2 H °. So the 
continuum problem turns out to be a question from the “multiplication ta¬ 
ble” of cardinal numbers, namely, the problem to evaluate a certain infinite 
product (in fact the simplest non-trivial one that can be formed). There is, 
however, not one infinite product (of factors > 1) for which only as much 
as an upper bound for its value can be assigned. All one knows about the 
evaluation of infinite products are two lower bounds due to Cantor and 
Konig (the latter of which implies a generalization of the aforementioned 
negative theorem on the power of the continuum), and some theorems con¬ 
cerning the reduction of products with different factors to exponentiations 
and of exponentiations to exponentiations with smaller bases or exponents. 
These theorems reduce 7 the whole problem of computing infinite products 
to the evaluation of and the performance of certain fundamental 

operations on ordinal numbers, such as determining the limit of a series 
of them. and therewith all products and powers, can easily be 


3 See Hausdorj'f 1914 , P- 68. The discoverer of this theorem, J. Konig, asserted more 
than he had actually proved (see his 1905). 

4 See the list of definitions at the end of this paper. 

5 See Hausdorff 1935, p. 32. Even for complements of analytical sets the question 
is undecided at present, and it can be proved only that they have (if they are infinite) 
either the power Ho or Hi or that of the continuum (see Kuratowski 1933 , p. 246). 

6 See Sierpinski 1934- 

7 This reduction can be effected owing to the results and methods of Tarski 1925. 
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computed 8 if the “generalized continuum hypothesis” is assumed, i.e., if 
it is assumed that 2 N " = M a+ i for every a , or, in other terms, that the 
number of subsets of a set of power M a is M„ +1 . But, without making any 
undemonstrated assumption, it is not even known whether or not m < n 
implies 2 m < 2™ (although it is trivial that it implies 2 TO < 2"), nor even 
whether 2 N ° < 2** 1 . 


3. Restatement of the problem on the basis 
of an analysis of the foundations of set theory 
and results obtained along these lines 

This scarcity of results, even as to the most fundamental questions in 
this field, may be due to some extent to purely mathematical difficulties; 
it seems, however (see Section 4 below), that there are also deeper reasons 
behind it and that a complete solution of | these problems can be obtained 518 
only by a more profound analysis (than mathematics is accustomed to give) 
of the meanings of the terms occurring in them (such as “set”, “one-to-one 
correspondence”, etc.) and of the axioms underlying their use. Several 
such analyses have been proposed already. Let us see then what they give 
for our problem. 

First of all there is Brouwer’s intuitionism, which is utterly destructive 
in its results. The whole theory of the M’s greater than Mi is rejected as 
meaningless. 9 Cantor’s conjecture itself receives several different mean¬ 
ings, all of which, though very interesting in themselves, are quite different 
from the original problem, and which lead partly to affirmative, partly to 
negative answers; 10 not everything in this field, however, has been clari¬ 
fied sufficiently. The “half-intuitionistic” standpoint along the lines of H. 
Poincare and H. Weyl 11 would hardly preserve substantially more of set 
theory. 

This negative attitude towards Cantor’s set theory, however, is by no 
means a necessary outcome of a closer examination of its foundations, but 
only the result of certain philosophical conceptions of the nature of mathe¬ 
matics, which admit mathematical objects only to the extent in which they 


8 For regular numbers one obtains immediately: 

N cf(n„) _ = 2 u a _ 

9 See Brouwer 1909. 

10 See Brouwer 1907, I, 9; III, 2. 

11 See Weyl 1932. If the procedure of construction of sets described there (p. 20) 
is iterated a sufficiently large (transfinite) number of times, one gets exactly the real 
numbers of the model for set theory spoken of below in Section 4, in which the continuum 
hypothesis is true. But this iteration would hardly be possible within the limits of the 
half-intuitionistic standpoint. 
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are (or are believed to be) interpretable as acts and constructions of our 
own mind, or at least completely penetrable by our intuition. For some¬ 
one who does not share these views there exists a satisfactory foundation 
of Cantor’s set theory in its whole original extent, namely, axiomatics of 
set theory, under which the logical system of Principia mathematica (in a 
suitable interpretation) may be subsumed. 

It might at first seem that the set-theoretical paradoxes would stand in 
the way of such an undertaking, but closer examination shows that they 
cause no trouble at all. They are a very serious problem, but not for 
Cantor’s set theory. As far as sets occur and are necessary in mathematics 
(at least in the mathematics of today, including all of Cantor’s set theory), 
they are sets of integers, or of rational numbers (i.e., of pairs of integers), 
or of real numbers (i.e., of sets of rational numbers), or of functions of real 
numbers (i.e., of sets of pairs of real numbers), etc.; when theorems about 
all sets (or the existence of sets) in general are asserted, they can always 
be interpreted without any difficulty to mean that they hold for sets of 
integers as well as for sets of real numbers, etc. (respectively, that there 
exist either sets of integers, or sets of real numbers, or ... etc., which have 
the asserted property). This concept of set, however, according to which 
a set is anything obtainable from the integers (or some other well-defined 
519 | objects) by iterated application 12 of the operation “set of”, 13 and not 

something obtained by dividing the totality of all existing things into two 
categories, has never led to any antinomy whatsoever; that is, the perfectly 
“naive” and uncritical working with this concept of set has so far proved 
completely self-consistent. 14 

But, furthermore, the axioms underlying the unrestricted use of this 
concept of set, or, at least, a portion of them which suffices for all mathe¬ 
matical proofs ever produced up to now, have been so precisely formulated 
in axiomatic set theory 15 that the question whether some given proposition 
follows from them can be transformed, by means of logistic symbolism, into 


12 This phrase is to be understood so as to include also transfinite iteration, the 
totality of sets obtained by finite iteration forming again a set and a basis for a further 
application of the operation “set of”. 

13 The operation “set of x’s” cannot be defined satisfactorily (at least in the present 
state of knowledge), but only be paraphrased by other expressions involving again the 
concept of set, such as: “multitude of a:’s", “combination of any number of x's". "part 
of the totality of x’s”; but as opposed to the concept of set in general (if considered as 
primitive) we have a clear notion of this operation. 

14 It follows at once from this explanation of the term “set” that a set of all sets 
or other sets of a similar extension cannot exist, since every set obtained in this way 
immediately gives rise to further application of the operation “set of” and, therefore, to 
the existence of larger sets. 

15 See, e.g., Bemays 1937, 1941, 1942, 1942a, 1943, von Neumann 1925: cf. also von 
Neumann 1928a and 1929, Godel 1940. 
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a purely combinatorial problem concerning the manipulation of symbols 
which even the most radical intuitionist must acknowledge as meaningful. 

So Cantor’s continuum problem, no matter what philosophical standpoint 
one takes, undeniably retains at least this meaning: to ascertain whether 
an answer, and if so what answer, can be derived from the axioms of set 
theory as formulated in the systems quoted. 

Of course, if it is interpreted in this way, there are (assuming the con¬ 
sistency of the axioms) a priori three possibilities for Cantor’s conjecture: 

It may be either demonstrable or disprovable or undecidable. 16 The third 
alternative (which is only a precise formulation of the conjecture stated 
above that the difficulties of the problem are perhaps not purely mathe¬ 
matical) is the most likely, and to seek a proof for it is at present one of 
the most promising ways of attacking the problem. One result along these 
lines has been obtained already, namely, that Cantor’s conjecture is not 
disprovable from the axioms of set theory, provided that these axioms are 
consistent (see Section 4). 

It is to be noted, however, that, even if one should succeed in proving its 
undemonstrability as well, this would (in contradistinction, for example, 
to the proof for the transcendency of n) by no means settle the question 
definitively. | Only someone who (like the intuitionist) denies that the 520 
concepts and axioms of classical set theory have any meaning (or any well- 
defined meaning) could be satisfied with such a solution, not someone who 
believes them to describe some well-determined reality. For in this reality 
Cantor’s conjecture must be either true or false, and its undecidability 
from the axioms as known today can only mean that these axioms do not 
contain a complete description of this reality; and such a belief is by no 
means chimerical, since it is possible to point out ways in which a decision 
of the question, even if it is undecidable from the axioms in their present 
form, might nevertheless be obtained. 

For first of all the axioms of set theory by no means form a system closed 
in itself, but, quite on the contrary, the very concept of set 17 on which they 
are based suggests their extension by new axioms which assert the existence 
of still further iterations of the operation “set of’. These axioms can also 
be formulated as propositions asserting the existence of very great cardinal 
numbers or (which is the same) of sets having these cardinal numbers. 


16 In case of the inconsistency of the axioms the last one of the four a priori pos¬ 
sible alternatives for Cantor’s conjecture would occur, namely, it would then be both 
demonstrable and disprovable by the axioms of set theory. 

17 Similarly also the concept “property of set” (the second of the primitive terms 
of set theory) can constantly be enlarged and, furthermore, concepts of “property of 
property of set” etc. be introduced whereby new axioms are obtained, which, however, 
as to their consequences for propositions referring to limited domains of sets (such as 
the continuum hypothesis) are contained in the axioms depending on the concept of set. 
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The simplest of these strong “axioms of infinity” assert the existence of 
inaccessible numbers (and of numbers inaccessible in the stronger sense) 
> No- The latter axiom, roughly speaking, means nothing else but that the 
totality of sets obtainable by exclusive use of the processes of formation 
of sets expressed in the other axioms forms again a set (and, therefore, a 
new basis for a further application of these processes). 18 Other axioms of 
infinity have been formulated by P. Mahlo. 19 Very little is known about 
this section of set theory; but at any rate these axioms show clearly, not 
only that the axiomatic system of set theory as known today is incomplete, 
but also that it can be supplemented without arbitrariness by new axioms 
which are only the natural continuation of the series of those set up so far. 

That these axioms have consequences also far outside the domain of very 
great transfinite numbers, which are their immediate object, can be proved; 
each of them (as far as they are known) can, under the assumption of consis¬ 
tency, be shown to increase the number of decidable propositions even in the 
field of Diophantine equations. As for the continuum problem, there is little 
hope of solving it by means of those axioms of infinity which can be set up 
on the basis of principles known today (the above-mentioned proof for the 
undisprovability of the continuum hypothesis, e.g., goes through for all 
of them without any change). But probably there exist others based on 
hitherto unknown principles; also there may exist, besides the ordinary 
521 axioms, the axioms of infinity and | the axioms mentioned in footnote 17, 
other (hitherto unknown) axioms of set theory which a more profound un¬ 
derstanding of the concepts underlying logic and mathematics would enable 
us to recognize as implied by these concepts. 

Furthermore, however, even disregarding the intrinsic necessity of some 
new axiom, and even in case it had no intrinsic necessity at all, a decision 
about its truth is possible also in another way, namely, inductively by study¬ 
ing its “success”, that is, its fruitfulness in consequences and in particular 
in “verifiable” consequences, i.e., consequences demonstrable without the 
new axiom, whose proofs by means of the new axiom, however, are consid¬ 
erably simpler and easier to discover, and make it possible to condense into 
one proof many different proofs. The axioms for the system of real num¬ 
bers, rejected by the intuitionists, have in this sense been verified to some 
extent owing to the fact that analytical number theory frequently allows 
us to prove number-theoretical theorems which can subsequently be ver¬ 
ified by elementary methods. A much higher degree of verification than 
that, however, is conceivable. There might exist axioms so abundant in 


18 See Zermelo 1930. 

19 See his 1911, pp. 190-200, 1913, pp. 269-276. From Mahlo’s presentation of the 
subject, however, it does not appear that the numbers he defines actually exist. 
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their verifiable consequences, shedding so much light upon a whole dis¬ 
cipline, and furnishing such powerful methods for solving given problems 
(and even solving them, as far as that is possible, in a constructivistic way) 
that quite irrespective of their intrinsic necessity they would have to be 
assumed at least in the same sense as any well-established physical theory. 


4. Some observations about the question: 

In what sense and in which direction may a solution 
of the continuum problem be expected? 

But are such considerations appropriate for the continuum problem? Are 
there really any strong indications for its unsolubility by the known axioms? 

I think there are at least two. 

The first one is furnished by the fact that there are two quite differently 
defined classes of objects which both satisfy all axioms of set theory written 
down so far. One class consists of the sets definable in a certain manner by 
properties of their elements, 20 the other of the sets in the sense of arbitrary 
multitudes irrespective of if, or how, they can be defined. Now, before it 
is settled what objects are to be numbered, and on the basis of what one- 
to-one correspondences, one could hardly expect to be able to determine 
their number (except perhaps in case of some fortunate coincidence). If, 
however, someone believes that it is meaningless to speak of sets except in 
the sense of extensions of definable properties, or, at least, that no other 
sets exist, then, too, he can hardly expect more than a small fraction of the 
problems of set theory to be solvable without making use of this, in his opin¬ 
ion essential, characteristic of sets, namely, that they are | all derived from 522 
(or in a sense even identical with) definable properties. This characteristic 
of sets, however, is neither formulated explicitly nor contained implicitly 
in the accepted axioms of set theory. So from either point of view, if in 
addition one has regard to what was said above in Section 2, it is plau¬ 
sible that the continuum problem will not be solvable by the axioms set 
up so far, but, on the other hand, may be solvable by means of a new axiom 


20 Namely, definable “in terms of ordinal numbers” (i.e., roughly speaking, under 
the assumption that for each ordinal number a symbol denoting it is given) by means 
of transfinite recursions, the primitive terms of logic, and the e-relation, admitting, 
however, as elements of sets and of ranges of quantifiers only previously defined sets. 
See my papers 1939a and 1940, where an exactly equivalent, although in its definition 
slightly different, concept of definability (under the name of “constructibility”) is used. 
The paradox of Richard, of course, does not apply to this kind of definability, since the 
totality of ordinals is certainly not denumerable. 
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which would state or at least imply something about the definability of 
sets . 21 

The latter half of this conjecture has already been verified; namely, the 
concept of definability just mentioned (which is itself definable in terms of 
the primitive notions of set theory) makes it possible to derive the general¬ 
ized continuum hypothesis from the axiom that every set is definable in this 
sense . 22 Since this axiom (let us call it “A”) turns out to be demonstrably 
consistent with the other axioms, under the assumption of the consistency 
of these axioms, this result (irrespective of any philosophical opinion) shows 
the consistency of the continuum hypothesis with the axioms of set theory, 
provided that these axioms themselves are consistent . 23 This proof in its 
structure is analogous to the consistency proof for non-Euclidean geome¬ 
try by means of a model within Euclidean geometry, insofar as it follows 
from the axioms of set theory that the sets definable in the above sense 
form a model for set theory in which furthermore the proposition A and, 
therefore, the generalized continuum hypothesis is true. But the definition 
of “definability” can also be so formulated that it becomes a definition of 
a concept of “set” and a relation of “element of” (satisfying the axioms of 
set theory) in terms of entirely different concepts, namely, the concept of 
“ordinal numbers”, in the sense of elements ordered by some relation of 
“greater” and “less”, this ordering relation itself, and the notion of “recur¬ 
sively defined function of ordinals”, which can be taken as primitive and be 
described axiomatically by way of an extension of Peano’s axioms . 24 (Note 
that this does not apply to my original formulation presented in the papers 
quoted above, because there the general concept of “set” with its element 
relation occurs in the definition of “definable set”, although the definable 
sets remain the same if, afterwards, in the definition of “definability” the 
term “set” is replaced by “definable set”.) 

523 | A second argument in favor of the unsolubility of the continuum problem 

on the basis of the ordinary axioms can be based on certain facts (not known 
or not existing at Cantor’s time) which seem to indicate that Cantor’s 


21 D. Hilbert’s attempt at a solution of the continuum problem (see his 1926), which, 
however, has never been carried through, also was based on a consideration of all possible 
definitions of real numbers. 

22 On the other hand, from an axiom in some sense directly opposite to this one the 
negation of Cantor’s conjecture could perhaps be derived. 

23 See my paper 1940 and note 1939a. For a carrying through of the proof in all 
details, my paper 1940 is to be consulted. 

24 For such an extension see Tarski 1924, where, however, the general concept of “set 
of ordinal numbers” is used in the axioms; this could be avoided, without any loss in 
demonstrable theorems, by confining oneself from the beginning to recursively definable 
sets of ordinals. 
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conjecture will turn out to be wrong; 25 for a negative decision the question 
is (as just explained) demonstrably impossible on the basis of the axioms 
as known today. 

There exists a considerable number of facts of this kind which, of course, 
at the same time make it likely that not all sets are definable in the above 
sense. 26 One such fact, for example, is the existence of certain properties of 
point sets (asserting an extreme rareness of the sets concerned) for which 
one has succeeded in proving the existence of undenumerable sets having 
these properties, but no way is apparent by means of which one could 
expect to prove the existence of examples of the power of the continuum. 
Properties of this type (of subsets of a straight line) are: (1) being of 
the first category on every perfect set, 27 (2) being carried into a zero set 
by every continuous one-to-one mapping of the line on itself. 28 Another 
property of a similar nature is that of being coverable by infinitely many 
intervals of any given lengths. But in this latter case one has so far not 
even succeeded in proving the existence of undenumerable examples. From 
the continuum hypothesis, however, it follows that there exist in all three 
cases not only examples of the power of the continuum, 29 but even such 
as are carried into themselves (up to denumerably many points) by every 
translation of the straight line. 30 

And this is not the only paradoxical consequence of the continuum 
hypothesis. Others, for example, are that there exist: (1) subsets of a 
straight line of the power of the continuum which are covered (up to denu¬ 
merably many points) by every dense set of intervals, or (in other terms) 
which contain no undenumerable subset nowhere dense on the 
straight line, 31 (2) subsets of a straight line of the power of the contin¬ 
uum which contain no undenumerable zero set, 32 (3) subsets of Hilbert 


25 Views tending in this direction have been expressed also by N. Luzin in his 1935, 
p. 129 ff. See also Sierpinski 1935. 

26 That all sets are “definable in terms of ordinals” if all procedures of definition, i.e., 
also quantification and the operation x with respect to all sets, irrespective of whether 
they have or can be defined, are admitted could be expected with more reason, but still 
it would not at all be justified to assume this as an axiom. It is worth noting that the 
proof that the continuum hypothesis holds for the definable sets, or follows from the 
assumption that all sets are definable, does not go through for this kind of definability, 
although the assumption that these two concepts of definability are equivalent is, of 
course, demonstrably consistent with the axioms. 

27 See Sierpinski 1934a and Kuratowski 1933, p. 269 ff. 

28 See Luzin and Sierpinski 1918 and Sierpinski 1934a. 

29 For the 3rd case see Sierpinski 1934, P- 39, Theorem 1. 

30 See Sierpinski 1935a. 

31 See Luzin 1914, P- 1259. 

32 See Sierpinski 1924 , p- 184. 
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space of the power of the continuum which contain no undenumerable sub¬ 
set of finite dimension, 33 (4) an infinite sequence A 1 of decompositions of 
524 any set M of the power of the continuum into continuum j many mutually 
exclusive sets A* x such that, in whichever way a set A x is chosen for each i. 
II(M — A®.) is always denumerable. 34 Even if in (l)-(4) “power of the 
continuum” is replaced by “Mi”, these propositions are very implausible; 
the proposition obtained from (3) in this way is even equivalent with (3). 

One may say that many of the results of point-set theory obtained 
without using the continuum hypothesis also are highly unexpected and 
implausible. 35 But, true as that may be, still the situation is different 
there, insofar as in those instances (such as, e.g., Peano’s curves) the ap¬ 
pearance to the contrary can in general be explained by a lack of agreement 
between our intuitive geometrical concepts and the set-theoretical ones oc¬ 
curring in the theorems. Also, it is very suspicious that, as against the 
numerous plausible propositions which imply the negation of the contin¬ 
uum hypothesis, not one plausible proposition is known which would imply 
the continuum hypothesis. Therefore one may on good reason suspect that 
the role of the continuum problem in set theory will be this, that it will 
finally lead to the discovery of new axioms which will make it possible to 
disprove Cantor’s conjecture. 


Definitions of some of the technical terms 

Definitions 4-12 refer to subsets of a straight line, but can be literally 
transferred to subsets of Euclidean spaces of any number of dimensions; 
definitions 13-14 refer to subsets of Euclidean spaces. 

1. I call “character of cofinality” of a cardinal number m (abbreviated by 
“cf(m)”) the smallest number n such that m is the sum of n numbers 

< m. 

2. A cardinal number m is regular if cf(m) = m, otherwise singular. 

3. An infinite cardinal number m is inaccessible if it is regular and has 
no immediate predecessor (i.e., if, although it is a limit of numbers 

< m, it is not a limit of fewer than m such numbers); it is inaccessible 
in the stronger sense if each product (and, therefore, also each sum) of 
fewer than m numbers < m is < m. (See Sierpiriski and Tarski 1930 , 
Tarski 1938. From the generalized continuum hypothesis follows the 
equivalence of these two notions. This equivalence, however, is a 


33 See Hure/uncz 1932. 

34 See Braun and Sierpinski 1932 , p. 1, proposition (Q). This proposition and the 
me stated under (3) in the text are equivalent with the continuum hypothesis. 

35 See, e.g., Blumenthal 1940. 




Cantor’s continuum problem 


187 


much weaker and much more plausible proposition. No evidently is 
inaccessible in both senses. As for finite numbers, 0 and 2 and no 
others are inaccessible in the stronger sense (by the above definition), 
which suggests that the same will hold also for the correct extension 
of the concept of inaccessibility to finite numbers.) 

4. A set of intervals is dense if every interval has points in common 
with some interval of the set. (The endpoints of an interval are not 
considered as points of the interval.) 

5. A zero set is a set which can be covered by infinite sets of intervals 
with arbitrarily small lengths-sum. 

6. A neighborhood of a point P is an interval containing P. 

7. A subset A of B is dense in B if every neighborhood of any point of 
B contains points of A. 

8. A point P is in the exterior of A if it has a neighborhood containing 
no point of A. 

9. A subset A of B is nowhere dense on B if those points of B which 
are in the exterior of A are dense in B. (Such sets A are exactly the 
subsets of the borders of the open sets in B, but the term “border-set” 
is unfortunately used in a different sense.) 

10. A subset A of B is of the first category in B if it is the sum of 
denumerably many sets nowhere dense in B. 

| 11. Set A is of the first category on B if the intersection A ■ B is of the 525 
first category in B. 

12. A set is perfect if it is closed and has no isolated point (i.e., no point 
with a neighborhood containing no other point of the set). 

13. Borel sets are defined as the smallest system of sets satisfying the 
postulates: 

(1) The closed sets are Borel sets. 

(2) The complement of a Borel set is a Borel set. 

(3) The sum of denumerably many Borel sets is a Borel set. 

14. A set is analytic if it is the orthogonal projection of some Borel set 
of a space of next higher dimension. (Every Borel set therefore is, of 
course, analytic.) 

15. Quantifiers are the logistic symbols standing for the phrases: “for all 
objects x” and “there exist objects x”. The totality of objects x to 
which they refer is called their range. 

16. The symbol “x” means “the set of those objects x for which ...”. 
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Introductory note to 1949 
and 1952 


These two papers represent Godel’s main contribution to relativis¬ 
tic cosmology. In the 1920s and the 1930s, the Friedmann-Robertson- 
Walker cosmological models had been introduced as the simplest solu¬ 
tions of the equations of Einstein’s general theory of relativity that were 
consistent with the observed red-shift of distant galaxies. These models 
were spatially homogeneous and isotropic, and were expanding but were 
non-rotating. Godel was the first to consider models that were rotating. 
The possible rotation of the universe has a special significance in general 
relativity because one of the influences that led Einstein to the theory 
in 1915 was Mach’s principle. The exact formulation of the principle is 
rather obscure, but it is generally interpreted as denying the existence of 
absolute space. In other words, matter has inertia only relative to other 
matter in the universe. The principle is generally taken to imply that 
the local inertial frame defined by gyroscopes should be non-rotating 
with respect to the frame defined by distant galaxies. 

Godel showed that it was possible to have solutions of the Einstein 
field equations in which the galaxies were rotating with respect to the lo¬ 
cal inertial frame. He therefore demonstrated that general relativity does 
not incorporate Mach’s principle. Whether or not this is an argument 
against general relativity depends on your philosophical viewpoint, but 
most physicists nowadays would not accept Mach’s principle, because 
they feel that it makes an untenable distinction between the geometry 
of space-time, which represents the gravitational and inertial field, and 
other forms of fields and matter. 

In the first of these papers (1949) Godel presented a rotating solu¬ 
tion that was not expanding but was the same at all points of space and 
time. This solution was the first to be discovered that had the curious 
property that in it it was possible to travel into the past. This leads to 
paradoxes such as “What happens if you go back and kill your father 
when he was a baby?” It is generally agreed that this cannot happen 
in a solution that represents our universe, but Godel was the first to 
show that it was not forbidden by the Einstein equations. His solution 
generated a lot of discussion of the relation between general relativity 
and the concept of causality. 

The second paper (1952) describes more reasonable rotating cosmo¬ 
logical models that are expanding and that do not have the possibil¬ 
ity of travel into the past. These models could well be a reasonable 
description of the universe that we observe, although observations of the 
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isotropy of the microwave background indicate that the rate of rotation 
must be very low. 


S. W. Hawking 


[For a more detailed discussion of Godel’s cosmological models, 
the reader is referred to Hawking and Ellis 1973, pages 168-170, to 
Malament 1985, and to Raychaudhuri 1979, pages 92-95.] 


An example of a new type 
of cosmological solutions 
of Einstein’s field equations of gravitation 
( 1949 ) 


1. The main properties of the new solution 

All cosmological solutions with non-vanishing density of matter known 
at present 1 have the common property that, in a certain sense, they contain 
an “absolute” time coordinate, 2 owing to the fact that there exists a one- 
parametric system of three-spaces everywhere orthogonal on the world lines 
of matter. It is easily seen that the non-existence of such a system of three- 
spaces is equivalent with a rotation of matter relative to the compass of 
inertia. In this paper I am proposing a solution (with a cosmological term 
^ 0) which exhibits such a rotation. This solution, or rather the four¬ 
dimensional space S which it defines, has the further properties: 

(1) S is homogeneous, i.e., for any two points P, Q of S there exists a 
transformation of S into itself which carries P into Q. In terms of physics 
this means that the solution is stationary and spatially homogeneous. 

(2) There exists a one-parametric group of transformations of S into 
itself which carries each world line of matter into itself, so that any two 
world lines of matter are equidistant. 


1 See, for example, Robertson 1933. 

2 As to the philosophical consequences which have been drawn from this circumstance 
see Jeans 1936 and my article 1949a, forthcoming in the Einstein volume of the Library 
of Living Philosophers. 
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(3) S has rotational symmetry, i.e., for each point P of S there exists 
a one-parametric group of transformations of S into itself which carries P 
into itself. 

(4) The totality of time-like and null vectors can be divided into + - and 
- -vectors in such a way that: (a) if £ is a +-vector, — £ is a — -vector, (b) 
a limit of + - (or - -)vectors, if ^ 0, is again a + - (or - -)vector. That 
is, a positive direction of time can consistently be introduced in the whole 
solution. 

After a direction of time has been introduced in this way, a temporal 
orientation is defined for the world line of every (real or possible) particle 
of matter or light, i.e., it is determined for any two neighboring points on 
it which one is earlier. On the other hand, however, no uniform temporal 
ordering of all point events, agreeing in direction with all these individual 
orderings, exists. This is expressed in the next property: 

(5) It is not possible to assign a time coordinate t to each space-time 
point in such a way that t always increases, if one moves in a positive time¬ 
like direction; and this holds both for an open and a closed time coordinate. 

(6) Every world line of matter occurring in the solution is an open line 
of infinite length, which never approaches any of its preceding points again; 
but there also exist closed time-like lines. 3 In particular, if P, Q are any 
two points on a world line of matter, 4 and P precedes Q on this line, there 
exists a time-like line connecting P and Q on which Q precedes P; i.e., it 
is theoretically possible in these worlds to travel into the past, or otherwise 
influence the past. 

(7) There exist no three-spaces which are everywhere space-like and 
intersect each world line of matter in one point. 

(8) If E is any system of mutually exclusive three-spaces, each of which 
intersects every world line of matter in one point, 5 then there exists a' 
transformation which carries S and the positive direction of time into itself, 
but does not carry E into itself; i.e., an absolute time does not exist, even if 
it is not required to agree in direction with the times of all possible observers 
(where “absolute” means: definable without reference to individual objects, 
such as, e.g., a particular galactic system). 

(9) Matter everywhere rotates relative to the compass of inertia with 
the angular velocity 2(irKp)z , where p is the mean density of matter and k 
Newton’s gravitational constant. 


3 If the tangent of a line is discontinuous, the line is to be considered as time-like only 
if the corners can be so rounded off that the resulting line is everywhere time-like. 

4 “World line of matter” without further specification always refers to the world lines 
of matter occurring as such in the solution under consideration. 

5 Another hypothesis about E under which the conclusion holds is that E is one- 
parametric and oriented (where the orientation refers to the space whose points are the 
elements of E). 
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2. Definition of the linear element 
and proof that it satisfies the field equations 


The linear element of S is defined by the following expression: 6 

a 2 {dx o 2 - dxx 2 + (e 2xi /2 )dx 2 2 ~ dx z 2 + 2 e Xl dx 0 dx 2 ), 

448 | where a is a positive number. The matrices of the and g lk , therefore, 

are the two: 


1 

0 e Xl 

0 


-1 

0 2e- x i 

0 

0 

-1 0 

0 

1 

0 

1 0 

0 

e Xl 

0 e 2xi /2 

0 

’ a 2 ' 

2e _Xl 

0 —2e -2xi 

0 

0 

0 0 

-1 


0 

0 0 

-1 


Owing to the fact that only two of the forty dgnjdxi are ^ 0, namely 
dg22ldxi and Ogo2/Oxi, the and can very easily be computed. 
One obtains the values: 

To,12 = -Id,02 = r 2)0 i = (a 2 /2)e Xl , 
ri , 22 = -r 2 ,i 2 = -(a 2 / 2 )e 2xi , 

I’m = 1, r ?2 = rj 2 = e Xl /2, 

r 22 = e 2xi /2, = -e Xl . 


These Ti^i and T‘ kl , and those obtained from them by interchanging the 
last two (or the lower two) indices are the only ones that do not vanish. 
Using for Rik the formula 7 


Rik = 


I d 2 log g 1 dlogg 

2 OxiOx). 2 tk Ox a 


6 This quadratic form can also be written thus 

a 2 jfcfcco + eXl dx 2 ) 2 — dx 1 2 — ^Y~dx 2 2 — dx 3 2 j , 
which makes it evident that, as required, its signature is everywhere —2. The three- 
space obtained by leaving out the term —dxz 2 has a simple geometric meaning (see 
448 below). Essentially the | same three-space, but with the signature +3 and with more 
general values of the constants, has been investigated in connection with the theory of 
continuous groups, without any reference to relativity theory. See, for example, Bianchi 
1918, p. 565. 

"Note that physicists frequently denote with —Rik what is here denoted with Rik, 
with a corresponding change of sign in the field equations. 
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and taking account of the fact that d/dot .except for i = 1, vanishes for 
every magnitude of the solution, and that g — (a 8 / 2)e 2xi , we obtain 


Rik = 


r rj k + rl k -r^ k . 


This yields the values for the R ik 

Roo = 1 ) R22 — e 2xi , R02 = R20 — e Xl ! 


all other Rik vanish. Hence, 


R = 1/a 2 . 

The unit vector u in the direction of the Xo-lines has the contravariant 
components 1/a, 0, 0, 0 and, therefore, the covariant components a, 0, 
ae Xl , 0. 

Hence we obtain: 

R ik = 1/a 2 • UiU k . 

Since, furthermore, R is a constant, the relativistic field equations (with 
the :ro-lines as world lines of matter), i.e., the equations 8 

Rik ~ \ 9 ikR = 87 tKpuiUk + Xg ik 

are satisfied (for a given value of p ) if we put 

1/a 2 = 8-itkp, A = -R/2 = —1/2 a 2 = —47r up. 

The sign of the cosmological constant here is the opposite of that occurring 
in Einstein’s static solution. It corresponds to a positive pressure. 


3. Proofs for the properties enumerated 

That there exists no one-parametric system of three-spaces orthogonal 
on the .x'oTines follows immediately from the necessary and sufficient con¬ 
dition which a vector field v in a four-space must satisfy, if there is to exist 


s The linear element is supposed to give time-like distances in seconds and space-like 
distances in light seconds. Therefore, the coefficient of UiU k differs from the usual one 
by a factor c 2 . 
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a system of three-spaces everywhere orthogonal on the vectors of the field. 
This condition requires that the skew symmetric tensor 

(dvk dvi \ (dvi dvi \ (dvi dvk \ 

° M V ‘ V<9xj dxk ) +Vk dxi) +Vl dx k dx t ) 

should vanish identically. The components of the corresponding vector 
e 3*kl 

w 3 = -— ~a ik i, 

6 - y/g 

however, in our case (i.e., for Vi = iq) have the values 0, 0, 0, \/2/o 2 . 
The non-vanishing of w 3 shows, moreover, that there exist no surfaces 
orthogonal on the Xo-lines in the subspaces X3 = constant. 

If v is the unit vector representing the velocity of matter, the vector w 
(which evidently is always orthogonal to v) is twice the angular velocity of 
matter in a local inertial system in whose origin matter is at rest at the 
moment considered. 9 Hence, property (9) follows at once. 

The properties (1) and (2) follow from the directly verifiable fact that 
the space S admits the following four systems of transformations into itself, 


(I) 

II II 

' + b 

for i ^ 0 

(II) 

#2 = x 2 + b 

Xi = x^ for i ^ 2 

(III) 

X 3 = X 3 ‘ 
Xi = Xi 

+ b 

for i ^ 3 

(IV) 

xi = xi ' + b 
x 2 = x 2 'e~ b 
%o = x 0 ' 
x 3 = x 3 ', 


where b is an arbitrary real number. 

A division of the time-like and null vectors into +- and —vectors as 
required by (4) can be effected by defining ( to be a +- or a —vector 
according as to whether the inner product (£u) = gikC uk is > or < 0. 

In order to prove (3) we introduce new coordinates r, <j), t, y (where r, 
ip, t are cylindrical coordinates in the subspaces X3 = constant, and y, up 
to a constant factor, is = X 3 ) by the following formulas of transformation, 
449 | which are easily solvable with respect to the Xi , 

e Xl = ch2r + cos (p sh2r 
x 2 e Xl = V2 sin ip sh2r 


9 This is an immediate consequence of the definition of a local inertial system, which 
requires that g t k = ±#fc l and dgik/8xi = 0 for every i, k, l. 
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(t x o - 2 A 
\2 ' 2V2 ) 


e 2 r tg-, where 

= 2 ». 


x 0 - 2t I 

2v^2 I 


This leads 10 to the expression for the linear element, 

4 a 2 (dt 2 — dr 2 — dy 2 4 - (sh 4 r — sh 2 r)d<j> 2 + 2 v / 2 sh 2 rd</>dt), 


which directly exhibits the rotational symmetry, since the < 7 ,*, do not depend 

Property ( 6 ) now follows easily: if c is defined by she = 1 (i.e., c = 
log(l + \/2 )), then for any JR > c we have sh 4 f?—sh 2 R > 0 ; hence, the circle 
defined by r = R, t = y = 0 is everywhere time-like (the positive direction 
of time, by the above definition, being that of increasing (f>). Hence, the 
line defined by 


r = R, y = 0, t = -a<j> (0 < <f> < 2ix) 

for sufficiently small a also will be everywhere time-like. However, the 
initial point Q of this line (i.e., the point corresponding to <j) = 0) and the 
end point P (i.e., that corresponding to 0 = 27r) are situated on the Mine: 
r = R, y = <j> = 0, and P precedes Q on this line if a > 0. Repeating this 
procedure, any point preceding Q on its Mine can be reached, and because 
of the homogeneity of the solution the same can be done for every point. 

Property (7), in view of ( 2 ) and (4), is an immediate consequence of ( 6 ). 
For, a three-space satisfying the two conditions stated in (7) in conjunc¬ 
tion with time measured along the world lines of matter in their positive 
direction would yield a coordinate system with the property that the 0 th 
coordinate always increases if one moves in a positive time-like direction, 
in contradiction to ( 6 ), which implies that all coordinates of the initial and 
the end point of a time-like fine are equal in certain cases. 

Property (5), for an open time coordinate, is an immediate consequence 
of the existence of closed time-like lines; for a closed time coordinate it 
follows from the fact that the subspaces t = constant would contradict 
property (7) (as can easily be shown owing to the simple connectivity of 
S). 

In order to prove property (8), let U be an element of E; then U intersects 
the subspace Sq of S defined by = 0 in a surface V (for it has one point 


10 This computation is rather cumbersome. It is simpler to derive both forms of the 
linear element independently from each other from the geometrical meaning of S given 
below. The first form is obtained by taking for the sia^-space of the coordinate system 
the point set corresponding to any two-parametric subgroup of the multiplicative group 
of the hyperbolic quaternions as defined in footnote 14. 
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in common with each x 0 -line situated on So). Now, according to what was 
proved, V cannot be orthogonal on all Zq- lines in So- So let / be an £o-line 
in So on which it is not orthogonal, and P the point of intersection of V 
and l. Then by rotating So around l (and every S& defined by x 0 = b 
by the same angle around the ^o-line obtained from l by the translation 
x' 3 = X3 + b), U goes over into a three-space different from U, but passing 
through P, hence not contained in E, since the elements of E were assumed 
to be mutually exclusive. Hence E goes over into a system different from E. 


4. Some additional theorems and considerations 
about the solution 

I am mentioning without proof that, disregarding the connectivity in 
the large (which can be changed by identifying the points of certain point 
sets with each other), the solution given and Einstein’s static universe are 
the only spatially homogeneous cosmological solutions with non-va nishing 
density of matter and equidistant world lines of matter. 11 

The space S has a simple geometric meaning. It is the direct product of 
a straight line and the three-space So, defined by x 3 = 0; and So is obtained 
from a space R of constant positive curvature and signature + — — by 
stretching the metric 12 in the ratio y/2:1 in the direction of a system of 
time-like Clifford parallels. 13 

This definition of So also leads to an elegant representation of its group 
of transformations. To this end we map the points of R on the hyperbolic 
quaternions uo 4- itiii + U2J2 + u 3j3 of positive absolute value 14 by means 


11 There exist stationary homogeneous solutions in which the world lines of matter 
are not equidistant. They lead, however, into difficulties in consequence of the inner 
friction which would arise in the “gas” whose molecules are the galaxies, unless the 
irregular motion of the galaxies is zero and stays so. 

12 By “stretching the metric in the ratio ft in the direction of the lines of a system 
7r” I mean that a new distance PQ' of neighboring points is introduced by the equation 
( PQ ') 2 = PR 2 + (fi ■ RQ) 2 , where R is the foot of the perpendicular drawn from P on 
the line of 7r passing through Q\ or in other terms: (ds') 2 = ds 2 + (p 2 — l)(ui dx,) 2 , 
where v is the field of the tangent vectors of unit length of the lines of 7r. 

13 That is, a system of pairwise equidistant straight lines which for each point of space 
contains exactly one line passing through it. 

14 Here the u, are real numbers and the units j n are defined by ji = ii, 32 = i ■ i2, 
jz = i-iz, where the i n are the units of the ordinary quaternions and i the imaginary unit, 
which is assumed to commute with all i n . The term “hyperbolic quaternions” occurs in 
the literature in a different sense, but the number system just defined evidently is what 
should be so called. For: norm(u) = u-u — uo 2 + ui 2 — 112 2 — U3 2 , and moreover, the 
multiplicative group of these quaternions, if quaternions differing by a real factor are 
identified, is isomorphic with the group of transformations of the Lobatchefskian plane 
into itself. That the metric of R remains invariant under the transformations given in 
the text follows immediately from the equation norm(uv) =norm(«)-norm(u). 
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of projective coordinates 15 u^uiv^us so chosen that Klein’s fundamental 
quadric takes on the form | u 0 2 + ui 2 — u 2 2 — u 3 2 . Then, any motion u —*• u' 450 
of R into itself can be represented in the form u' — p-u-q, where p and q are 
hyperbolic quaternions of positive norm. A system ir of Clifford parallels 
can be represented by a a ■ u, where a is a hyperbolic quaternion depending 
on 7r alone, and the individual lines of 7r are obtained by assigning a fixed 
value to u and varying a from —oo to +oo. It follows that those motions 
of R into itself which leave w (and the orientation of its lines) invariant are 
represented by u' = • u ■ q, where /? varies over all real numbers and q 

over all hyperbolic quaternions of positive norm. These motions, however, 
evidently form the four-parametric continuous group of transformations 
which carry So into itself. The lines of 7r, of course, are the world lines of 
matter. 

Evidently, in whatever ratio p, the metric of R is stretched in the di¬ 
rection of the lines of 7r, the resulting space R' has rotational symmetry. 
Therefore, the contracted Riemann tensor of R! x l ( l being a straight 
line), if the coordinate system in the point considered is orthonormal, and 
its first basis vector has the direction of the 7r-lines, its last one 
the direction of l, has the form 



where a and b are functions of p. Computation shows that p = \/2 yields 
6 = 0, i.e., Rik = a ■ which makes it possible to satisfy the field 

equations in the manner described above. 

As to the physical meaning of the solution proposed in this paper, it is 
clear that it yields no red shift for distant objects. For, by using the trans¬ 
formation (I) defined in the proof of the properties (1) and (2), one proves 
immediately that light signals sent from one particle of matter (occurring 
in the solution) to another one arrive with the same time intervals in which 
they are sent. For the period of rotation one obtains 2 • 10 11 years, if for 
p the value of 10 _3O g/cm 3 is substituted. Assuming galactical systems 
were formed by condensation of matter originally distributed uniformly, 
and taking for the ratio of contraction 1:200 (which is suggested by the 


15 It is to be noted, however, that there exist different topological forms of spaces 
of constant positive curvature and signature —1, and that that form which can be rep¬ 
resented in projective coordinates in a one-to-one manner does not lead exactly to the 
space S defined before, but rather to a space obtained from S by identifying any two 
points which are situated on the same line of the system 7r and whose distance on that 
line is equal to a certain constant. A corresponding difference subsists for the groups of 
transformations. 
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observed average ratio of 1:200 between diameter and distance of galaxies), 
one obtains (using the law of conservation of angular momentum) for the 
average period of rotation of galactic systems 5 • 10 6 years. This number is 
of the correct order of magnitude, but, in view of the fact that this would 
have to be approximately the period of rotation in the outer parts of the 
nebulae, the observed value is found to be considerably larger. 16 Of course 
such comparison with observation has very little significance before an ex¬ 
pansion has been combined with the rotation. Moreover, an explanation 
would have to be found for the apparent irregularity of the distribution of 
the axes of rotation of the galaxies. This, however, is perhaps not impos¬ 
sible, since there exist various circumstances which would tend to blur the 
original order, or make it appear blurred, especially if the axes of rotation 
of matter in different places (unlike in the solution described above) were 
not parallel with each other. The radius of the smallest time-like circles, in 
the solution given in this paper, is of the same order of magnitude as the 
world radius in Einstein’s static universe. 


16 From the numerical data which E. Hubble {1934, P- 74) gives about two galaxies 
of medium size follow periods of rotation of 2 • 10 7 and 7 ■ 10' years at a distance of 
about half the radius from the center. The period of rotation of the Andromeda Nebula 
in the central region is estimated at 1.5 -10 7 years. 




Introductory note to 1949a 


This paper, written for a collection intended to honor and to dis¬ 
cuss the work of Einstein, appears to be the only published piece by 
Godel that deals with philosophical issues not directly concerned with 
mathematics. In it Godel argues, on the basis of the very interesting 
cosmological solutions of Einstein’s general-relativistic field equations 
obtained by him (1949), that those philosophers are right who have de¬ 
nied the “objectivity of change” . a 

A caution seems in order concerning the use in the title of this paper 
of the phrase “idealistic philosophy”. The word “idealism” has been used 
historically in connection with a very diverse class of metaphysical views, 
whose common characteristic is the claim that what is ultimately “real” 
is something fundamentally “mental”. By no means all such philosophies 
have denied the objectivity of change—for change may be attributed to 
minds or their contents. On the other hand, the contention that change 
is not objective, but is in some sense a “mere appearance”, need not 
be associated with the view that all that is real is mental; and, indeed, 
it is far from plain in Godel’s paper that the latter is his own view, 
since he bases his argument on the physical possibility (“compatibility 
with the laws of nature”) of worlds in which temporal relations have 
the bizarre characteristics he describes: thus his conclusion seems to be, 
not that the world of physics is grounded in something “mental”, but 
that our conception of the world as changing is subjective or illusory - a 
contribution of our minds. 


a One point in Godel’s discussion of his cosmological solutions perhaps deserves 
comment, although it does not substantially affect the argument of the paper. Godel 
defends these solutions against a charge of absurdity by the consideration that the 
“time travel” that is physically possible in the “worlds” they describe would not 
be practically possible. Others have in fact rejected these solutions as “unphysical” 
because of the possibility of time travel. But it is hard to see the cogency of this 
rejection—or the need for the defense Godel offers. Objections of the type “What if 
I were to go back and, for example, murder my own younger self?” admit a perfectly 
straightforward answer: in a cosmos of the sort in question, that act would simply 
not be possible. It would seem, in fact, that such a cosmos would have to be regarded 
as fully deterministic—or fully determinate; but Godel’s own argument against “the 
objectivity of change” leads in any case to determinateness as characteristic of things. 
And, after all, classical physics was generally conceived as deterministic. However 
obnoxious this notion has been to some philosophies, the objection “But I can always 
choose whether or not, for instance, to lift my arm” has never really carried any 
weight; and the objection raised in the context of time travel, although in some sense 
more poignant, is fundamentally of the same type. 
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This conclusion has a distinct relation to the position of Kant, to 
whom Godel refers and who himself repudiated metaphysical idealism, 
but asserted what he called the “transcendental ideality” of time (as 
well as of space). The force of this assertion was that spatial and tem¬ 
poral attributes fundamentally characterize, not things “as they are in 
themselves”, but a certain relation of those things to us—to our faculties 
of perception and representation. In particular, it is the special consti¬ 
tution of these latter faculties, according to Kant, that is responsible 
for the general structures of space and time that form the subject of 
geometry and of what may be called “pure chronometry”; in this sense, 
Kant characterizes these disciplines as concerned with the “pure form” 
of our “intuitive” (that is, our receptive or sensitive) faculty. As to the 
issue of metaphysical idealism, Kant rejects altogether any claims to 
knowledge of what things are apart from our experience (what they are 
“in themselves”); but within experience—that is, within the entire field 
of what can in any way be known—the structures of space and time 
by his doctrine are fully objective; as “forms” that condition the very 
possibility of perception, they constitute a universal framework for all 
objective scientific knowledge. Thus, affirming the “transcendental ide¬ 
ality” of space and time, Kant as emphatically asserts their “empirical 
reality”. Furthermore, on his doctrine, these universal and empirically 
real structures can be known independently of experience, just because 
they are effects of our own constitution and are conditions of all possible 
experience. 

On this latter point, it is evident that Godel cannot adhere to Kant’s 
view, since his own examples depart radically from the structure Kant 
thought necessary a priori for science, and since, far from claiming a 
grounding in something like Kant’s “pure intuition”, he emphasizes the 
“astonishing” and “strange” character of the results that form the basis 
of his argument, and their departure from “the intuitive idea” of an abso¬ 
lute and objective lapse of time. Further fight is thrown on the question 
of Godel’s own conception of the relation of his view to Kant’s by an as 
yet unpublished manuscript (found in his Nachlass, and bearing the title 
“Some observations about the relationship between theory of relativity 
and Kantian philosophy”), which discusses that relation in some detail 
and which makes explicit that a central difference from Kant concerns 
just this point: Kant, in Godel’s opinion, overemphasized in his episte¬ 
mological discussion the dependence of spatiotemporal structure upon 
our faculty of representation, and was led by this into two errors—he 
concluded, erroneously, that the temporal properties of things (perhaps 
one should rather say, “of events”) must be the same for all human be¬ 
ings (since human beings all have the same species of representational 
faculty); and he failed to see that geometry is at least in one sense an 
empirical science. 
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Thus, so far as Kantian philosophy is concerned, the principal anal¬ 
ogy that Godel has in mind between it and relativity theory concerns the 
strong sense in which temporal properties become (in general) well de¬ 
fined only relative to certain structures within the world: the world-lines 
of bodies. In the case of the bodies of sentient beings, these world-lines 
are also the loci of their immediate sensual contact with the reality out¬ 
side themselves—so that in this special ease the relation can be said 
to be “to the sensibility of the observer”. This relativity of course af¬ 
fects the notion of “change” or “passage”, centrally emphasized in the 
present paper. But to be relative is not to be illusory: in this paper, 
Godel speaks of “an unequivocal proof for the view of those philosophers 
who ... consider change as an illusion due to our special mode of percep¬ 
tion” , whereas in the manuscript referred to (which does not explicitly 
mention “idealistic philosophy”) he puts great stress upon the objective 
character of the relations in question. 

In view of Godel’s well-known, long-standing, and deep interest in 
philosophical matters, it is cause for great regret that what we thus far 
possess of his reflections on such matters is so meager. The apparent 
discrepancy just noted between the present paper and the unpublished 
manuscript makes one wish both to know their comparative dates (if 
that can be determined) and to obtain further light—perhaps recon¬ 
ciling what appears discrepant—upon the metaphysical view here too 
briefly adumbrated. There is even greater need for clarification of the 
epistemological side of these views: we have so far in print only a brief 
comment, quoted (from a letter) in Greenberg 1980 (page 250), sug¬ 
gesting that Godel did believe that we have in some sense an a priori 
“physical intuition” of spatial structure “in the small”; and some enig¬ 
matic but intriguing remarks (Godel 1964, pages 271- 272, and Wang 
1974, pages 84-85) about the relation to reality of human knowledge. It 
is very much to be hoped that the materials found in the Godel Nachlass 
will help to illuminate our understanding of his philosophical position. 


Howard Stein 




A remark about the relationship between 
relativity theory and idealistic philosophy 
(1949a) 


One of the most interesting aspects of relativity theory for the philo¬ 
sophical-minded consists in the fact that it gave new and surprising in¬ 
sights into the nature of time, of that mysterious and seemingly self¬ 
contradictory 1 being which, on the other hand, seems to form the basis 
of the world’s and our own existence. The very starting point of special 
relativity theory consists in the discovery of a new and very astonishing 
property of time, namely the relativity of simultaneity, which to a large 
extent implies 2 that of succession. The assertion that the events A and B 
are simultaneous (and, for a large class of pairs of events, also the assertion 
that A happened before B) loses its objective meaning, in so far as another 
observer, with the same claim to correctness, can assert that A and B are 
not simultaneous (or that B happened before A). 

Following up the consequences of this strange state of affairs, one is led 
to conclusions about the nature of time which are very far reaching indeed. 
In short, it seems that one obtains an unequivocal proof for the view of 
those philosophers who, like Parmenides, Kant, and the modern idealists, 
deny the objectivity of change and consider change as an illusion or an 
558 appearance due to our special mode of perception. 3 The argu|ment runs 
as follows: Change becomes possible only through the lapse of time. The 
existence of an objective lapse of time, 4 however, means (or, at least, is 
equivalent to the fact) that reality consists of an infinity of layers of “now” 


1 Cf., e.g., McTaggart 1908. 

2 At least if it is required that any two point events are either simultaneous or one 
succeeds the other, i.e., that temporal succession defines a complete linear ordering of 
all point events. There exists an absolute partial ordering. 

3 Kant, in the Critique of pure reason ( 1 7S7, p. 54), expresses this view in the following 
words: “Those affections which we represent to ourselves as changes, in beings with 
other forms of cognition, would give rise to a perception in which the idea of time, and 
therefore also of change, would not occur at all.” This formulation agrees so well with 
the situation subsisting in relativity theory that one is almost tempted to add: such as. 
e.g., a perception of the inclination relative to each other of the world lines of matter in 
Minkowski space. 

4 One may take the standpoint that the idea of an objective lapse of time (whose 
essence is that only the present really exists) is meaningless. But this is no way out 
of the dilemma; for by this very opinion one would take the idealistic viewpoint as to 
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which come into existence successively. But, if simultaneity is something 
relative in the sense just explained, reality cannot be split up into such 
layers in an objectively determined way. Each observer has his own set of 
“nows”, and none of these various systems of layers can claim the prerog¬ 
ative of representing the objective lapse of time. 5 

This inference has been pointed out by some, although by surprisingly 
few, philosophical writers, but it has not remained | unchallenged. And ac- 559 
tually to the argument in the form just presented it can be objected that the 
complete equivalence of all observers moving with different (but uniform) 
velocities, which is the essential point in it, subsists only in the abstract 
space-time scheme of special relativity theory and in certain empty worlds 
of general relativity theory. The existence of matter, however, as well as 
the particular kind of curvature of space-time produced by it, largely de¬ 
stroys the equivalence of different observers 6 and distinguishes some of 


the idea of change, exactly as those philosophers who consider it as self-contradictory. 
For in both views one denies that an objective lapse of time is a possible state of affairs, 
a fortiori that it exists in reality, and it makes very little difference in this context, 
whether our idea of it is regarded as meaningless or as self-contradictory. Of course, for 
those who take either one of these two viewpoints the argument from relativity theory 
given below is unnecessary, but even for them it should be of interest that perhaps there 
exists a second proof for the unreality of change based on entirely different grounds, 
especially in view of the fact that the assertion to be proved runs so completely counter 
to common sense. A particularly clear discussion of the subject independent of relativity 
theory is to be found in Mongre 1898. 

5 It may be objected that this argument only shows that the lapse of time is something- 
relative, which does not exclude that it is something objective, whereas idealists maintain 
that it is something merely imagined. A relative lapse of time, however, if any meaning 
at all can be given to this phrase, would certainly be something entirely different from the 
lapse of time in the ordinary sense, which means a change in the existing. The concept 
of existence, however, cannot be relativized without destroying its meaning completely. 
It may furthermore be objected that the argument under consideration only shows that 
time lapses in different ways for different observers, whereas the lapse of time itself may 
nevertheless be an intrinsic (absolute) property of time or of reality. A lapse of time, 
however, which is not a lapse in some definite way seems to me as absurd as a colored 
object which has no definite colors. But, even if such a thing were conceivable, it would 
again be something totally different from the intuitive idea of the lapse of time to which 
the idealistic assertion refers. 

6 Of course, according to relativity theory all observers are equivalent in so far as the 
laws of motion and interaction for matter and field are the same for all of them. But this 
does not exclude that the structure of the world (i.e., the actual arrangement of matter, 
motion, and field) may offer quite different aspects to different observers, and that it 
may offer a more “natural” aspect to some of them and a distorted one to others. The 
observer, incidentally, plays no essential role in these considerations. The main point, 
of course, is that the [four-dimensional] world itself has certain distinguished directions, 
which directly define certain distinguished local times. 
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them conspicuously from the rest, namely, those which follow in their mo¬ 
tion the mean motion of matter. 7 Now in all cosmological solutions of the 
gravitational equations (i.e., in all possible universes) known at present 
the local times of all these observers fit together into one world time, so 
that apparently it becomes possible to consider this time as the "true" one. 
which lapses objectively, whereas the discrepancies of the measuring results 
of other observers from this time may be conceived as due to the influence 
which a motion relative to the mean state of motion of matter has on the 
measuring processes and physical processes in general. 

From this state of affairs, in view of the fact that some of the known 
cosmological solutions seem to represent our world correctly. James Jeans 
has concluded 8 that there is no reason to abandon the intuitive idea of an 
absolute time lapsing objectively. I do not think that the situation justifies 
560 this conclusion and am basing my opinion chiefly 9 on the following facts 
and considerations: 

There exist cosmological solutions of another kind 10 than those known 
at present, to which the aforementioned procedure of defining an absolute 
time is not applicable, because the local times of the special observers 
used above cannot be fitted together into one world time. Nor can any 
other procedure which would accomplish this purpose exist for them: i.e.. 
these worlds possess such properties of symmetry that for each possible 
concept of simultaneity and succession there exist others which cannot be 
distinguished from it by any intrinsic properties, but only by reference to 
individual objects, such as. e.g.. a particular galactic system. 


7 The value of the mean motion of matter may depend essentially on the size of the 
regions over which the mean is taken. What may be called the "true mean motion" is 
obtained by taking regions so large that a further increase in their size does not any 
longer change essentially the value obtained. In our world this is the case for regions 
including many galactic systems. Of course a true mean motion in this sense need not 
necessarily exist. 

8 Cf. Jeans 1936. pp. 22-23. 

9 Another circumstance invalidating Jeans' argument is that the procedure described 
above gives only an approximate definition of an absolute time. No doubt it is possible to 
refine the procedure so as to obtain a precise definition, but perhaps only by introducing 
more or less arbitrary elements (such as. e.g.. the size of the regions or the weight function 
to be used in the computation of the mean motion of matter). It is doubtful whether 
there exists a precise definition which has so great merits that there would be sufficient 
reason to consider exactly the time thus obtained as the true one. 

10 The most conspicuous physical property distinguishing these solutions from those 
known at present is that the compass of inertia in them everywhere rotates [in the same 
direction] relative to matter, which in our world would mean that it rotates relative to 
the totality of galactic systems. These worlds, therefore, can fittingly be called "rotating 
universes". In the subsequent considerations I have in mind a particular kind of rotating 
universes which have the additional properties of being static and spatially homogeneous, 
and a cosmological constant < 0. For the mathematical representation of these solutions, 
cf. my forthcoming 1949 [and. for a general discussion of rotating universes, niy 1952], 
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Consequently, the inference drawn above as to the non-objectivity of 
change doubtless applies at least in these worlds. Moreover it turns out 
that temporal conditions in these universes (at least in those referred to 
in the end of footnote 10) show other surprising features, strengthening 
further the idealistic viewpoint. Namely, by making a round trip on a 
rocket ship in a sufficiently wide curve, it is possible in these worlds to 
travel into any region of the past, present, and future, and back again, 
exactly as it is possible in other worlds to travel to distant parts of space. 

This state of affairs seems to imply an absurdity. For it enables one. e.g.. 
to travel into the near past of those places where | he has himself lived. 
There he would find a person who would be himself at some earlier period 
of his life. Now he could do something to this person which, by his memory, 
he knows has not happened to him. This and similar contradictions, how¬ 
ever. in order to prove the impossibility of the worlds under consideration, 
presuppose the actual feasibility of the journey into one's own past. But 
the velocities which would be necessary in order to complete the voyage in a 
reasonable length of time 11 are far beyond everything that can be expected 
ever to become a practical possibility. Therefore it cannot be excluded a 
priori, on the ground of the argument given, that the space-time structure 
of the real world is of the type described. 

As to the conclusions which could be drawn from the state of affairs 
explained for the question being considered in this paper, the decisive point 
is this: that for every possible definition of a world time one could travel 
into regions of the universe which are past according to that definition. 12 
This again shows that to assume an objective lapse of time would lose every 
justification in these worlds. For. in whatever way one may assume time to 
be lapsing, there will always exist possible observers to whose experienced 
lapse of time no objective lapse corresponds (in particular also possible 


11 Basing the calculation on a mean density of matter equal to that observed in our 
world, and assuming one were able to transform matter completely into energy, the 
weight of the "fuer of the rocket ship, in order to complete the voyage in t years (as 
measured by the traveller), would have to be of the order of magnitude of 10 22 jt 2 times 
the weight of the ship (if stopping, too. is effected by recoil). This estimate applies to 
t -C 10 11 . Irrespective of the value of t. the velocity of the ship must be at least l/\/2 
of the velocity of light. 

[Translation of the author's addition to the German edition (1955): A second reason 
for excluding a priori the universes mentioned above could be found in the possibility of 
"telegraphing a message into one's own past''. But the practical difficulties in doing so 
would hardly seem to be trifling. Moreover, the boundary between difficulties in practice 
and difficulties in principle is not at all fixed. What was earlier a practical difficulty in 
atomic physics has today become an impossibility in principle, in consequence of the 
uncertainty principle: and the same could one day happen also for those difficulties that 
reside not in the domain of the "too small". but of the "too large".] 

12 For this purpose incomparably smaller velocities would be sufficient. Under the 
assumptions made in footnote 11 the weight of the fuel would have to be at most of the 
same order of magnitude as the weight of the ship. 
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observers whose whole existence objectively would be simultaneous). But, 
if the experience of the lapse of time can exist without an objective lapse 
of time, no reason can be given why an objective lapse of time should be 
assumed at all. 

It might, however, be asked: Of what use is it if such conditions prevail 
in certain possible worlds? Does that mean anything for the question in- 
562 teresting us whether in our world there | exists an objective lapse of time? 
I think it does. For: (1) Our world, it is true, can hardly be represented by 
the particular kind of rotating solutions referred to above (because these 
solutions are static and, therefore, yield no red-shift for distant objects); 
there exist however also expanding rotating solutions. In such universes 
an absolute time also might fail to exist, 13 and it is not impossible that 
our world is a universe of this kind. (2) The mere compatibility with the 
laws of nature 14 of worlds in which there is no distinguished absolute time, 
and [in which], therefore, no objective lapse of time can exist, throws some 
light on the meaning of time also in those worlds in which an absolute time 
can be defined. For, if someone asserts that this absolute time is lapsing, 


13 At least if it required that successive experiences of one observer should never be 
simultaneous in the absolute time or (which is equivalent) that the absolute time should 
agree in direction with the times of all possible observers. Without this requirement 
an absolute time always exists in an expanding (and homogeneous) world. Whenever 
I speak of an “absolute” time, this of course is to be understood with the restriction 
explained in footnote 9, which also applies to other possible definitions of an absolute 

[Translation of the author’s addition to the German edition (1955): By an “absolute 
time” I understand a world time that can be defined without reference to particular 
objects and that satisfies the requirement formulated at the beginning of this footnote. 
More precisely, this should be called a “possible absolute time”, since several can exist 
within one world, even though that is only exceptionally the case in spatially homoge¬ 
neous universes.] 

14 The solution considered above only proves the compatibility with the general form 
of the field equations in which the value of the cosmological constant is left open; this 
value, however, which at present is not known with certainty, evidently forms part of the 
laws of nature. But other rotating solutions might make the result independent of the 
value of the cosmological constant (or rather of its vanishing or non-vanishing and of its 
sign, since its numerical value is of no consequence for this problem). At any rate these 
questions would first have to be answered in an unfavorable sense before one could think 
of drawing a conclusion like that of Jeans mentioned above. Note added 2 September 
1949: I have found in the meantime that for every value of the cosmological constant 
there do exist solutions in which there is no world time satisfying the requirement of 
footnote 13. 

[Translation of the author’s addition to the German edition (1955): The second law of 
thermodynamics would also seem to be compatible with the solutions above. For within 
them a positive direction can be defined for all time-like lines in a unique and continuous 
way. Furthermore, the probability of any material system returning exactly to a former 
state is vanishingly small; and, if that happens only approximately, it merely means that 
somewhere two examples of the same system (in general having different entropies) exist 
simultaneously side by side. Of course, the initial conditions in such worlds cannot be 
chosen entirely freely.] 
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he accepts as a consequence that whether or not an objective lapse of time 
exists (i.e., whether or not a time in the ordinary sense of the word exists) 
depends on the particular way in which matter and its motion are arranged 
in the world. This is not a straightforward contradiction; nevertheless, a 
philosophical view leading to such consequences can hardly be considered 
as satisfactory. 



Rotating universes in general relativity theory 
( 1952 ) 


[The introductory note to this paper and to 1949 is found on page 189, 
immediately preceding 1949.J 

In this lecture I am setting forth the main results (for the most part 
without proofs) to which my investigations on rotating universes have led 
me so far. 


1. Definition of the type of rotatory solutions 
to be considered 

I am starting from the relativistic field equations: 1 

(1) Rik - ^ 9ikR = T ik - A g ik 

and am assuming that: 

1) the relative velocity of masses (i.e., galactic systems) close to each 
other is small compared with c; 

2) no other forces except gravitation come into play. 

Under these assumptions T& takes on the form: 


(2) Tik — P v i v k 
where: 

(3) p > 0, 

(4) g ik ViV k = -1, 
and, of course: 

(5) The signature of gik is + 2. 

The local angular velocity of matter relative to the compass of inertia 


1 1 am supposing that 
Surk/c 2 = 1. 


units are introduced as make c = 1, 
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can be represented by the following vector uj (which is always orthogonal 
on v ): 


( 6 ) 




where the skew-symmetric tensor aki m is defined by: 


( 7 ) 


0>klm = v k 


( dVl 

\dx rn 


dv m \ fdv m dvk\ f dvk dvi \ 
dxi ) + V>1 \ dxk dx m ) + Vm \ dxi dx k ) 


That ui represents the angular velocity relative to the compass of in¬ 
ertia is seen as follows: In a coordinate system which, in its origin, is 
geodesic and normal, and in whose origin matter is at rest (i.e., for which in 
O: dgik/dxi — 0, gtk = Riki v 4 — 1, v' = 0 for i ± 4), 2 one obtains for w* 
in O : 


(df_ 




(9) co 4 = 0. 

| In such a coordinate system, however, since parallel displacement (in its 
origin) means constancy of the components, the angular velocity relative 
to the compass of inertia, in O, is given by the same expressions as in 
Newtonian physics, i.e. the right-hand sides of (8) are its components. Ev¬ 
idently uj is the only vector the first 3 components of which, in the particular 
coordinate systems defined, coincide with the angular velocity computed as 
in Newtonian physics and the 4th component is 0. 

Any Riemann 4-space with some p, Vi defined in it, which everywhere 
satisfies the conditions (l)-(5) and permits of no extension free from sin¬ 
gularities, and for which, moreover, u> is continuous and / 0 in every point, 
represents a rotating universe. However, in the sequel I am chiefly con¬ 
cerned with solutions satisfying the following three further postulates (sug¬ 
gested both by observation and theory): 

I. The solution is to be homogeneous in space (i.e., for any two world 
lines of matter l, m there is to exist a transformation of the solution into 
itself which carries l into m). 

II. Space is to be finite (i.e., the topological space whose points are the 
world lines of matter is to be closed, i.e., compact). 

III. p is not to be a constant. 


2 A coordinate system satisfying the first two conditions may fittingly be called a 
“local inertial system”. 
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Postulate III is indispensable also for rotating universes, since it can be 
proved that a red-shift which, for small distances, increases linearly with 
the distance implies an expansion, no matter whether the universe rotates 
or not, 3 

As to the question of the existence of rotating solutions satisfying the 
postulates I, II, III, cf. §5. 


2. Some general properties of these solutions 

In view of III the equation p = constant defines a one-parameter system 
of 3-spaces. In rotating universes these 3-spaces of constant density cannot 
he orthogonal on the world lines of matter. This follows immediately from 
the fact that aki m = 0 is the necessary and sufficient condition for the 
existence of any system of 3-spaces orthogonal on a vector field v. 

The inclination of the world lines of matter toward the spaces of constant 
density yields a directly observable necessary and sufficient criterion for the 
rotation of an expanding spatially homogeneous and finite universe: namely, 
for sufficiently great distances, there must be more galaxies in one half of 
the sky than in the other half. 

In the first approximation, i.e., for solutions differing little from one 
spatially isotropic, the magnitude of this effect is given by the following 
theorem: If N \, N% are the numbers of galaxies in the two hemispheres 
into which a spatial sphere 4 of radius r (small compared with the world 
177 radius R) is decomposed by a \ plane orthogonal on ui, then: 

|JVi - N 2 \ 9 M rRh 

' J AIi + AT 2 8' c 2 ’ 

where h is Hubble’s constant (=R/R). 

For plausible values of the constants (where oj is estimated from the 
velocity of rotation of the galaxies 5 ) this effect is extremely small. But the 
uncertainty in the knowledge of the constants is too great for drawing any 
definitive conclusions. 

The group of transformations existing owing to I evidently carries each of 
the spaces p = constant into itself, and therefore (the case of isotropy being 


3 Provided, of course, that the atomic constants do not vary in time and space, or, 
to be more exact, provided that the dimensionless numbers definable in terms of the 
constants of nature (such as e 2 /he) are the same everywhere. 

4 I.e., one situated in a 3-space orthogonal on v at the point under consideration. 

5 Cf. my 194.9 , p. 450. 
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excluded) can only have 3 or 4 parameters. 6 The number 4 (i.e., the case 
of rotational symmetry) cannot occur either. There exist no rotationally 
symmetric rotating universes satisfying the conditions stated in §1. 7 The 
only symmetry around one point which can occur is that of one rotation 
by 7 t. This case will be referred to as the symmetric one. 

In any case the group of transformations must be 3-parameter. Since 
moreover, owing to II, it must be compact, and since (as can easily be 
shown) it cannot be commutative in rotating universes, 8 it follows that the 
group of transformations of any rotating solution of the type characterized 
in §1 must be isomorphic (as a group of transformations) with the right (or 
the left ) translations of a 3-space of constant positive curvature, or with 
these translations plus certain rotations by an angle n. Hence also the 
topological connectivity of space must be that of a spherical or elliptical 
3-space. 

The metric gik can be decomposed (relative to the world lines of mat¬ 
ter) into a space-metric gik and a time-metric gik, by defining the spatial 
distance of two neighboring points Pi, P2 to be the orthogonal distance of 
the two world lines of matter passing through Pi, P2, and the temporal 
distance to be the orthogonal projection of P1P2 on one of these two lines. 
This decomposition evidently is exactly that which (in the small) holds for 
the observers moving along the world lines of matter. It has the following 
properties: 

(11) W = -Viv k , giU = + Viv k , 


Det (g ik ) = Det (g ik ) = 0. 

If the coordinate system is so chosen that the a;4-lines are the world 
lines of matter and the x 4 -coordinate measures the length of these lines, 
gik takes on the form: 


( 12 ) 


° II 

° II 


| (where hik is positive definite) and the Hubble-constant in the space- 178 
direction dx l (orthogonal on v), as measured by an observer moving along 
with matter, becomes equal to: 


1 hikdx l dx k 

2 h i kdx i dx k ’ 


where hik 


dhj k 

dx 4 


6 There exists, in every space p = constant, a positive definite metric which is carried 
into itself, namely the metric hik defined below. 

7 This even is true irrespective of postulate II (the finiteness of space). 

8 The reason is that the curl of a vector field invariant under a transitive commutative 
group vanishes identically. 
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The surface hikXiXk = 1 in the 3-dimensional subspace, orthogonal on v. 
of the tangent space, may be called the ellipsoid of expansion or, more 
generally, the quadric of expansion. 

The theorem about the nonexistence of rotationally symmetric solu¬ 
tions, 9 under the additional hypothesis that the universe contains no closed 
time-like lines (cf. §3), can be strengthened to the statement that the 
quadric of expansion, at no moment of time, can be rotationally symmet¬ 
ric around u). In particular it can never be a sphere, i.e., the expansion 
is necessarily coupled with a deformation. This even is true for all solu¬ 
tions satisfying I III and gives another directly observable property of the 
rotating universes of this type. 

Moreover the asymmetry of the expansion around w opens up a possibil¬ 
ity for the explanation of the spiral structure of the galaxies. For, if under 
these circumstances a condensation is formed, the chances are that it will 
become an oblong body rotating around one of its smaller axes; and such 
a body, because its outer parts will rotate more slowly, will, in the course 
of time, be bent into a spiral. It remains to be seen whether a quantitative 
elaboration of this theory of the formation of spirals will lead to agreement 
with observation. 


3. Rotation and time-metric 

The formulae (6), (7), (11) show that it is, in the first place, the time- 
metric (relative to the observers moving along with matter) which deter¬ 
mines the behavior of the compass of inertia. In fact a necessary and 
sufficient condition for a spatially homogeneous universe to rotate is that 
the local simultaneity of the observers moving along with matter be not inte- 
grable (i.e., do not define a simultaneity in the large). This property of the 
time-metric in rotating universes is closely connected with the possibility 
of closed time-like lines. 

The latter anomaly, however, occurs only if the angular velocity sur¬ 
passes a certain limit. This limit, roughly speaking, is that value of |w| 
for which the maximum linear velocity caused by the rotation becomes 
equal to c; i.e., it is approximately c/R if, at the moment considered, the 
space-metric in the 3-space p — constant does not differ too much from a 
space of the constant curvature 1/R 2 . The precise necessary and sufficient 
condition for the nonexistence of closed time-like lines (provided that the 
179 one-parameter manifold of the spaces p — | constant is not closed) is that 


9 This theorem makes it very likely that there exist no rotating spatially homogeneous 
and expanding solutions whatsoever in which the ellipsoid of expansion is permanently 
rotationally symmetric around w. 





Rotating universes in general relativity theory 


213 


the metric in the spaces of constant density be space-like. 10 This holds for 
solutions satisfying all conditions stated in §1. 

For these solutions, also, the nonexistence of closed time-like lines is 
equivalent with the existence of a “world-time”, where by a world-time we 
mean an assignment of a real number t to every space-time point, which 
has the property that t always increases if one moves along a time-like line 
in its positive direction. 11 If in addition any two 3-spaces of simultaneity 
are equidistant and the difference of t is their distance, one may call it 
a metric world-time. If the spaces of constant density are space-like, a 
metric world-time can be defined by taking these 3-spaces as spaces of 
simultaneity. Evidently (up to transformations t = f(t)) this is the only 
world-time invariant under the group of transformations of the solution. 


4. Behavior of the angular velocity 
in the course of the expansion 


No matter whether postulates I—III are satisfied or not, the temporal 
change of lj is described by the following theorem: In a coordinate system 
in which the x 4 -lines are the world lines of matter, g 44 = — 1 everywhere, 
and moreover ga = 0 (for i / 4) on the X 4 -axis, one has along the whole 
X4 -axis: 

(13) w*(—3) 1 / 2 = cj’/i 1 / 2 = constant (i = 1,2,3). 


The proof can be given in a few lines: Evidently w 4 = 1, v l = 0 (for 
i ^ 4) everywhere; hence: Vi — ga- Substituting these values of in (7), 
one obtains on X 4 : 


(14) 


_ dg 4k _ dg 4i 
auk ~ dxi dx k ’ 


But dg4i/dx 4 — 0 (because the a^-lines are geodesics and g 44 — 
— 1). Hence by (14), dakim/dx 4 = 0 on X 4 . Hence by (6) also, 
0(w‘(-0) 1/2 )/a* 4 = 0 on X 4 . 

The equation (13) means two things: 


10 This condition, too, means that at the border separating the two cases the linear 
velocity caused by the rotation becomes equal to c, if by this linear velocity is understood 
the velocity of matter relative to the orthogonals on the spaces of constant density. 

X1 A time-like vector is positive if it is contained in the same half of the light-cone as 
the vector v. 
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A. that the vector uj (or, to be more exact, the lines l w whose tangent 
everywhere has the direction uj) permanently connects the same particles 
with each other; 

B. that the absolute value | u>\ increases or decreases in proportion to the 
contraction or expansion of matter orthogonal on uj, where this contraction 
or expansion is measured by the area of the intersection of an infinitesimal 
spatial cylinder 4 around (permanently including the same particles) with 
a surface orthogonal on l u . 

Since in the proof of (13) nothing was used except the fact that the 
180 world | lines of matter are geodesics (and in particular the homogeneity of 
space was not used), (13), and therefore A, B, also describe the behavior 
of the angular velocity, if condensations are formed under the influence of 
gravitation; 12 i.e., |u?|, under these circumstances, increases by the same 
law as in Newtonian mechanics. 

The direction of uj, even in a homogeneous universe, need not be dis¬ 
placed parallel to itself along the world lines of matter. The necessary and 
sufficient condition for it to be displaced parallel at a certain moment is 
that it coincide with one of the principal axes of the quadric of expansion. 
For, if P, Q are two neighboring particles connected by uj, then, only under 
the condition just formulated, the direction PQ at the given moment, will 
be at rest relative to the compass of inertia (in order to see this one only 
has to introduce the local inertial system defined in §1 (cf. footnote 2) and 
then argue exactly as in Newtonian physics). Since however (because of 
A) the direction of u> coincides permanently with the direction of PQ, the 
same condition applies for the direction of lu. This condition however, in 
general, is not satisfied (only in the symmetric case it is always satisfied). 

The fact that the direction of u> need not be displaced parallel to itself 
might be the reason for the irregular distribution of the directions of the 
axes of rotation of the galaxies (which at first sight seems to contradict an 
explanation of the rotation of the galaxies from a rotation of the universe). 
For, if the axis of rotation of the universe is not displaced parallel, the 
direction of the angular momentum of a galaxy will depend on the moment 
of time at which it was formed. 


5. Existence theorems 

It can be shown that, for any value of A (including 0), there exist oc 8 
rotating solutions satisfying all conditions stated in §1. The same is true 
if in addition it is required that a world-time should exist (or should not 


12 Of course, only as long as the gas and radiation pressure remain small enough to 
be neglected. 
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exist). The value of the angular velocity is quite arbitrary, even if p and 
the mean world radius (at the moment under consideration) are given. In 
particular, there exist rotating solutions with A = 0 which differ arbitrarily 
little from the spatially isotropic solution with A — 0. 

Thus the problem arises of distinguishing, by properties of symmetry or 
simplicity, certain solutions in this vast manifold of solutions. E.g., one 
might try to require that the universe should expand from one point and 
contract to one point. 


6. Method of proof 

The method of proof by which the results given above were obtained is 
based on postulate I of §1. This postulate implies that all world lines of 
matter (and all orthogonals on the spaces of constant density) are equiva¬ 
lent with each other. It is, therefore, sufficient to confine the consideration 
to one | such world line (or one such orthogonal). This reduces the problem 181 
to a system of ordinary differential equations. 

Moreover, this system of differential equations can be derived from a 
Hamiltonian principle, i.e., it is a problem of analytical mechanics with 
a finite number of degrees of freedom. The equations of relativity theory, 
however, assign definite values to the integrals of energy and momentum, so 
that the relativistic problem is a little more special than the corresponding 
one of analytical mechanics. 

The symmetric case, by means of the integrals of momentum, can be 
reduced to a problem with three degrees of freedom (<7i, <72)03 )> whose 
Lagrangian function reads as follows: 



where g = 0 i 0 2 03 and V is a constant which determines the velocity of rota¬ 
tion. The general case can be reduced to a system of differential equations 
of the 8th order. 


7. Stationary rotating solutions 

It might be suspected that the desired particular solutions (cf. §5 above) 
will have a close relationship to the stationary homogeneous solutions, and 
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it is therefore of interest to investigate these, too. By a stationary homo¬ 
geneous solution we mean one whose group, for any two points P, Q of the 
whole 4-space, contains transformations carrying P into Q. 

These solutions can all be determined and expressed by elementary func¬ 
tions. One thus obtains the following results: 

1. There exist no stationary homogeneous solutions with A = 0. 

2. There exist rotating stationary homogeneous solutions with finite 
space, no closed time-like lines, and A > 0; in particular also such as differ 
arbitrarily little from Einstein’s static universe. 

The world lines of matter in these solutions, however, are not equidistant: 
neighboring particles of matter, relative to the compass of inertia, rotate 
around each other, not in circles, but in ellipses (or, to be more exact, in 
rotating ellipses). 




Introductory note to 1958 and 1972 


1. Preliminary remarks; history of the paper 


Godel’s “ Dialectica paper” appeared in 1958, in German, in honor 
of P. Bernays’ 70th birthday; it is reproduced, following this introduc¬ 
tory note, together with a translation into English by Stefan Bauer- 
Mengelberg and Jean van Heijenoort. The ideas in this paper date back 
at least as far as 1941, since Godel lectured at that time on his interpre¬ 
tation at Princeton and Yale. (In Godel’s Nachlass there is the text of 
a lecture, “In what sense is intuitionistic logic constructive?”, given at 
Yale University on 15 April 1941.) 

In this volume there is also a second version of Godel 1958, which 
has not previously appeared in print. This, too, has a long history; in 
the form reproduced here it dates from circa 1972. Between the first 
and the second version, Godel’s interest shifted from the mathematical 
result to its philosophical aspects; as he wrote in a letter to Bernays of 
16 May 1968, “In those days, after all, I set no particular store by the 
philosophical aspect; rather, it was chiefly the mathematical result that 
was important to me, while now it is the other way around.” a Neverthe¬ 
less, 1958 is already presented by Godel as a foundational contribution, 
not as a technical one. 

Godel never managed to express his ideas on the philosophical aspects 
of the interpretation to his own satisfaction, as is evidenced by the vi¬ 
cissitudes of the second version of the Dialectica paper (henceforth cited 
as Godel 1972). In 1965 Bernays informed Godel of the plan to publish 
an English translation (by Leo F. Boron) of 1958, again in Dialectica. 
Godel then undertook to revise his paper for that occasion; and in Jan¬ 
uary 1967 he expressed satisfaction with the result. 

In 1968, however, on rereading the philosophical introduction to the 
original paper, Godel became dissatisfied with it and rewrote it com¬ 
pletely. Equally dissatisfied with this revision, he gave up the idea of 
rewriting the introduction and decided to add a series of notes (a-n) in¬ 
stead (Godel to Bernays, 16 May and 17 December 1968). In 1970 the re¬ 
vised version, after much active prompting and help from Bernays and D. 
S. Scott, was sent to the printer. When the proof sheets arrived, Godel 
was not pleased with notes i and h, and rewrote them almost completely, 


a “Ich legte ja damals keinen besonderen Wert auf das Philosophische, sondern 
;s kam mir hauptsachlich auf das mathematische Resultat an, wahrend es jetzt 
lmgekehrt ist.” 
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but never returned the proof sheets. Apparently he continued to make 
minor revisions and corrections as late as 1972—at least there are some 
handwritten corrections on the proof sheets which concern points raised 
by Bernays in a letter to Godel of 16 March 1972. 

Godel had been in poor health since 1968, and this may have had 
much to do with the hesitations and doubts that are manifested in the 
style of the additional notes in 1972, which lack the sureness of touch of 
Godel’s earlier work (see Kreisel 1980). A less subjective reason for the 
difficulties lies in the material itself: it is quite difficult to formulate the 
philosophical gain achieved in 1958 and 1972. 

The presentation of the mathematical results also leaves much to be 
desired; the system T is given in outline only, in the main body of text; 
the description is then expanded and/or modified in the additional notes 
for 1972. Similarly, the description of interpretations of T is not at all 
clear-cut. Accordingly, after discussing the aims of Godel’s paper in Sec¬ 
tion 2 below, we devote a (long) Section 3 to the technical background 
of Godel’s system T and his main result, while Section 4 is devoted 
to interpretations of T; Section 5 describes further work that has been 
prompted by Godel 1958. 

Our commentary on these papers concentrates on 1972] references to 
1958 are explicitly noted in the text. We use Troelstra 1973 as a source 
of technical background information. 

References to Godel’s notes: We list Godel’s notes 1-11 and a-n with 
an indication of where they are referred to in our introduction: 3 (2.2), 
5 (4.1), 6 (2.1, 2.2, 4.1), 7 (3.1, 3.2), 8 (3.2), 10 (3.1); b (2.1, 2.2), c 
(2.2, 3.3), d (5.1), e (2.1), g (4.2), h (2.1, 3.3, 4.1-3), i (3.1-3), j (3.1), 
1 (3.2), m (3.1), n (3.2, 3.3, 4.3). 


2. Aims of Godel’s paper 

2.1 Godel originally devised his interpretation for technical applica¬ 
tions; specifically, the underivability of ->-iVa:(A(a:) V ->A(x)) in intu- 
itionistic predicate logic ( Kleene 1973, note 7). In correspondence, G. 
Kreisel wrote that Godel wanted to establish that intuitionistic proofs 
of existential theorems provide explicit realizations. 

However, in 1958 Godel presents his results as a contribution to a 
liberalized version of Hilbert’s program: to justify classical systems, in 
particular arithmetic, in terms of notions as intuitively clear as possible. 
Hilbert wanted to find these intuitively clear notions in the domain of 
‘finitary mathematics’. 

In Hilbert’s sense, this may be described as mathematics of a purely 
combinatorial nature, dealing with configurations of finite, discrete, con- 
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cretely representable objects that can be surveyed (grasped) in all their 
parts. Elementary-school arithmetic may be regarded as typically fini- 
tary in Hilbert’s sense: it deals with natural numbers and certain specific 
operations on them, such as addition and multiplication, which have 
purely combinatorial character. On the other hand, the general concept 
of a function from N to N is not finitary (see Godel’s note b). Even more 
abstract in character, and therefore further removed from finitary math¬ 
ematics, is the use of abstract (intuitionistic) proofs in the explanation 13 
of the intuitionistic logical operators (see the opening paragraph of 1958 
or of 1972, and notes e and h). In this explanation the meaning of 
the logical operators is given by describing proofs of logically compound 
statements in terms of the constituent statements. Two typical clauses 
are the following: 

(1) p proves AM B if p is either of the form (0,p'), where p' is a proof 
of A, or of the form (1 ,p'), where p' is a proof of B. 

(2) p proves A —► B if p is a construction (method) which, when ap¬ 
plied to any proof q of A, yields a proof p(q) of B. (‘p proves -u4’ is a 
special case: p transforms any hypothetical proof of A into a proof of an 
absurdity.) 

Thus A V ->A is not valid for this interpretation. Accepting these 
explanations for intuitionistic logic provides us with a justification and 
consistency proof for intuitionistic first-order arithmetic HA (‘Heyting’s 
arithmetic’) comparable to the technically trivial consistency proof for 
classical first-order arithmetic PA (‘Peano arithmetic’) that simply con¬ 
sists in interpreting PA in the intended (standard) model and observing 
that the logical rules preserve truth. Combining the indicated justi¬ 
fication for HA with Godel’s own ‘negative translation’ ( 1933e ) that 
reduces 0 PA to HA, one obtains a justification of PA in terms of intui¬ 
tionistic principles. As Godel observes in note 6, even if we do not have 
a sufficiently clear idea of the notion of a constructive proof, we do not 
doubt that the laws of intuitionistic logic hold for it. (Conversely, if one 
is worried about the coherence of the explanations of intuitionistic logic 
in terms of abstract proofs, but accepts the classical notion of truth, 
then HA C PA is an immediate justification of HA.) 

2.2 Godel argues that, since the finitistic methods considered are not 
sufficient to carry out Hilbert’s program, one has to admit at least 
some abstract notions in a consistency proof; the necessity of this is 


b Godel calls this Heyting’s explanation, though “Heyting-Kolmogorov” might be 
more appropriate. See our introductory notes to 1933e and 1933f'va Volume I. 

c The systems of classical and intuitionistic arithmetic considered in 1933e are not 
identical with PA and HA of the recent literature, but that is not essential. 
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shown both by the second incompleteness theorem and by our experi¬ 
ence with known consistency proofs, all of which appeal at some point 
to an abstract notion (see the second paragraph of 1958 or 1972, and 
notes 3 and c; as to the use of the term ‘finitary’, see also b). In a 
letter to Bernays, dated 25 July 1969, Godel said that the restriction to 
‘visualizable’ objects in Hilbert’s finitism was quite unnatural. 

However, Godel did not want to go as far as admitting Heyting’s ab¬ 
stract notion of constructive proof; hence he tried to replace the notion 
of constructive proof by something more definite, less abstract (that is, 
more nearly finitistic), his principal candidate being a notion of ‘com¬ 
putable functional of finite type’ which is to be accepted as sufficiently 
well understood to justify the axioms and rules of his system T, an es¬ 
sentially logic-free theory of functionals of finite type (see note 6). 

2.3 The method used by Godel consists in associating with each A in 
the language of arithmetic a formula of the form 3aNyAn{x, y ), where 
Ad is quantifier-free and x, y are strings of variables for functionals 
of finite type. This association is such that, if HA proves A, then T 
proves Ao{t,y ) for a suitable sequence of terms t (not containing y). 
For a quantifier-free A , Ad = A ; thus consistency follows if we accept 
T, since, if T is correct, one cannot derive 1 = 0. 

We are not asked to think of the reinterpretation of the logical opera¬ 
tors of A, involved in the transition to Ad, as particularly fundamental; 
the main point is that it permits a translation of A in logic-free terms 
(see our comments in 3.2 below). 

2.4 In order to gain a better understanding of what exactly Godel 
achieved, it helps to go into the technical background and content of 
Godel’s paper. We do so at length in the next two sections; in 4.4 we 
shall return to an assessment of Godel’s reduction. 


3. Godel’s main result 

This section is quite long and technically the most complicated one 
in this introduction. The best procedure for the reader is perhaps to 
alternate between Godel’s text and our explanations, starting with the 
present section. 

3.1 The basic languages and systems 

As noted earlier, Godel’s description of his system T is rather sketchy: 
it is supposed to be a quantifier-free theory of computable finite-type 
functionals, with a primitive notion of decidable (intensional) equality 
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for all finite types (note 7) and in which functionals can be introduced 
by explicit definition and recursion. Additional specifications are con¬ 
tained in note i. To explain them in definite and precise terms, we begin 
by describing a version T of T that seems to correspond well to Godel’s 
intentions in 1958 and the main text of 1972\ from this we extract a 
subsystem T 0 suggested by note i and sufficient for Godel’s principal 
result. The motivation for our choice is found in subsection 3.3. 

We shall use A, V, —>, -i, V, 3 for the logical operators (Godel has D, 
( ) for —► , V). Intuitionistic arithmetic HA (Godel’s ‘H’; see notes 10, j 
and i5) is taken to be formulated with number variables ( x , y, z, u, v, 
w), the Peano axioms for zero (0) and successor (S + ), the induction ax¬ 
iom schema (or the corresponding rule), and constants for all primitive 
recursive functions, with their defining equations as axioms.* 1 

The type structure T (on which T is based) is generated from type 
0 (natural numbers) by the rule: if cr, are types, then so is 

(<7, ti, ... ,T n ). Intuitively, (cr, t\, . .., r n ) consists of n-place functions 
which, applied to arguments of types t\, ... ,t„, give a value of type cr. 
These are called functionals in general, since, for Ti ^ 0, the arguments 
are themselves functions. 

For Godel’s type (a, r) we shall write (r)<r or r —> a. Then Schonfin- 
kel’s device of reducing n-place functions to unary functions by means 
of the isomorphism X YxZ = (X Y ) Z permits us to think of the type (a, 
Ti,T n ) as an abbreviation for (n)... (r n )cr. The restriction to unary 
functionals is customary in the literature and technically convenient. We 
also use 1 for the type (0)0 and 2 for (1)0. 

The language C of T contains variables for each type a of T(x a , y a , 
z a , u a , v a , w a , possibly subscripted) and individual constants for cer¬ 
tain types, to be specified below. The informal interpretation is that the 
variables of type a, for <r/0, range over the computable functionals of 
that type; this will be analyzed in more detail in Section 4 below. 

For each type cr e T there is a primitive binary predicate constant =„ 
in C for equality at type a. 

The letters t, and s (possibly with sub- or superscripts) will be used 
for terms. l t is of type <r’ can be written as t £ cr or as t a ; where there 
is no ambiguity, type superscripts are suppressed. If t e (<t)t, t' e a , 
then App ffT (i, £') (“i applied to /'”) is a term of type r, for which we 
shall simply write t(1/) or even tt'. The notation ii £2 ■■ - t n is 
short for ti^Xfa) • • • (tn), which corresponds to Godel’s ii(<2, • •., t n ). 


d As observed by Godel in note m, one can dispense with negation as a primitive, 
defining -<A as A —» 1 = 0. In fact, in HA one can also dispense with disjunction, 
defining it as A V B := 3a:[(a: = 0 —> /\) A (x / 0 —* £?)]. In note i5 Godel observes 
that, because of the presence of the predecessor function, one of the Peano axioms is 
in fact redundant. 
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When suppressing type specifications for compound expressions, we shall 
always tacitly assume types to be ‘fitting’, that is, in a term tt', t must 
be of type (cr)r, t' of type <r, for certain a, r e T. 

Prime formulas of £ are term equations: t a =„ s a , or simply i — s 
without type indications; compound formulas are constructed by means 
of the propositional operators A, V, —->A is an abbreviation for A —> 
5+0 = 0. 

The logical basis of T is the intuitionistic (many-sorted) propositional 
calculus with equality, where equality is assumed to be decidable for each 
type a, that is, t = a sVt s. Equivalently, one might have taken clas¬ 
sical propositional logic as the basis, since the decidability of the prime 
formulas entails the decidability of all compound formulas by intuition¬ 
istic logic. Godel adds a rule of substitution: if we have derived A(x cr ), 
we may infer A(t a ); but this rule is not needed if, instead, all schemas 
formulated with free variables are replaced by schemas formulated for 
arbitrary terms. 

Before continuing our description of T, we introduce some further ab¬ 
breviations, which will enable us to deal with finite strings of variables 
and terms. We shall use u, v, w, x, y, z, U,...,Z for finite sequences 
of variables, and t, s, S for finite sequences of terms. 

Let x = (xi,...,x„), y = (t/i,...,y m ). Then V®, 3® abbreviate 
V®i... V®„, 3®i... 3x n respectively; \/xy stands for V®Vy; etc. If t = 
(ti,..., t n ) and 8 = (s i,..., s m ), then ts in a formula A(ts) stands for a 
finite sequence (tiSi... s m ,, t n s i... s m ); t = 8 is a sequence of equa¬ 
tions ti = Si,... ,t n — s n (n has to be equal to m). 

We often write t[®] or t[®i,...,®„] in order to refer to the (possi¬ 
bly empty) sets of free occurrences of the variables x, or ®i,..., ® n , in 
t. Then t[ti,. denotes the result of simultaneously substituting 

for ®i,..., x n , respectively. 

The non-logical axioms and rules for T contain the (Peano) axioms 
for zero (0) and successor ( S + ) and the induction rule 

IND-R >1(0), j4(s°) - ^(S'+x 0 ) => >1(® 0 ). 

Besides 0 and S + , T should also contain (i) constants for functionals 
introduced by explicit definition (combinatorial completeness), that is, 
whenever t is a term built by application from variables xi,.... x n and 
constants already introduced, there is a constant <j> such that 

(j)X\ ...x n =t, 

and (ii) constants for functionals defined by primitive recursion. 

(i) can be guaranteed by having A-abstraction in T, or, as is often 
technically more convenient, by having combinator constants A' CT T of 
type (<r)(r)cr and S PtlTtT of type ((p)(o r )r)((p)a)(p)(r such that 
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(1) K„ T xy — x, S Pt(TiT xyz = xz(yz); 

it is well known that these permit us to define a term A x.t[x], for each 
term t, such that (Ax.i[x])(i') = /,[<'] (see Troelstra 1973, 1.6.8). 

As to (ii), it suffices to add for each a a ‘recursor’ R a of type 
(<7)((cr)(0)cr)(0)<7 such that 

(2) R„xyO = x, R tr xy(S + z) = y{R a xyz)z. 

From the R a one can define (sequences of) constants for simultaneous 
recursion (see Troelstra 1973, 1.7). 

Following Godel’s note i4, let To (with language Co) be the subsystem 
of T obtained by restricting prime formulas to numerical equalities (that 
is, now =o is the only primitive notion of equality). Of course, the 
equality axioms (1) and (2) for higher types cannot be expressed in 
T 0 as they stand; instead, one assumes the corresponding replacement 
schemas; for an arbitrary term s[x] of type 0 we have 

, s s[Ktt'] = s[f], s[Stt’t"] = s[tt"(t't")], 

= «[t], s[Rtt'(S + t")] = s[t'{Rtt't")t"\ 

The decidability of equality for type 0 can be proved; for higher types 
the decidability assumption is dropped. 

This completes our description of one precise version T of Godel’s T 
and a significant subsystem To of T. 


3.2 Godel’s translation and interpretation 

Godel’s translation D (denoted by ' in Godel’s paper) assigns to each 
A in the language of HA a formula A D of the form 3xVyAjj(x,y), where 
Ad e Co (that is, Ad is quantifier-free). Godel’s principal result may 
then be stated as follows: 

(I) If HA b A, then for some sequence of terms t, T F A D ( t, y) 
(y not in t). 

In fact, already To F Ao(t, y). If T (T 0 ) is regarded as embodying 
evident principles, this can be regarded as a consistency proof for HA. 
To obtain a similar result for PA, one first applies the translation from 
Godel 1933e. 

It is not too easy to explain the intuitive content of A D in a few words; 
so we shall first formally define the translation and afterwards return to 
this question. A D is defined for arbitrary formulas by induction on their 
logical complexity. 
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(=) D For A prime, x, y are empty sequences and A D = Ad = A. 

For all other clauses, let A D = 3xVyAo(x,y), B D = 3uWvB D (u,v). 
Then 

(A°) {A A B) d : = 3xu\/vy(A D A B D ), 

(' M d ) (A V B) d : = 3 z°xuVvy[(z — 0 A Ad) V (z — 1 A Bd )], 

(V D ) (VzA(z)) d : = 3xVzyA D (xz,y), 

( 3 d ) (3 zA(z)) d : = 3zxVyA D (z,x,y), 

(-» D ) {A -+ B) D : = 3 UY1xv(A D (x, Yxv) -* B D ( Ux, v)). 

In the above definition certain obvious requirements on the variables are 
to be met; for example, in (A d ) the sequences x, u, v, y must all be 
kept disjoint, if necessary by renaming variables. As to the clause (V D ), 
see Godel’s note l. e 

Suppose PA I- A and let A! be a prenex form of ->A; with the help of 
the ‘negative translation’ ( Godel 1933e) one easily sees that HA I— A'\ 
then by (I) there are terms t such that HA h -iA' d (x, tx). This is 
Kreisel’s no-counterexample interpretation* for PA ( Kreisel 1951, 1952 ; 
see also Troelstra 1973 , 3.5.3). 

In 1958 and 1972 the definition of A D is given only for formulas A 
of the language of HA, that is, z is of type 0 in the clauses (V D ), (3 d ). 
But, formally, these clauses make just as good sense when extended to 
variables 2 of any finite type. We write C*, £q for the languages of T, 
T 0 extended with quantifiers for all finite types. A D is thus defined for 
all A in C*. 

The following may serve to motivate the definition of A D . It is ob¬ 
vious how we can constructively verify statements 3xVyA(x, y), for A 
quantifier-free, namely by providing an explicit t such that A(t, y) holds 
(the constructive meaning of a quantifier-free statement is taken to be 
immediate). If we wish to assign a similar interpretation to arbitrary A, 
we should look for a statement A D = 3x\/yAo(x, y) (classically) equiv- 


e The clause (V D ) can also be formulated as 

{A V B) d := 3z°xuVvy[(z = 0 - A D ) A (z ± 0 —> fl D )], 
which corresponds precisely with the definition of V given in footnote d. 

f This treatment of the no-counterexample interpretation (n.c.i.) introduces type-2 
functionals via the higher types. Such a detour can be avoided by a direct treatment, 
as shown by Tait. An adequate system of type-2 functionals is given in Tait 1965a. A 
nice treatment is in Schwichtenberg 1977. The no-counterexample interpretation, and 
hence also Godel’s interpretation, may be viewed as an extension and generalization 
of Herbrand’s theorem to arithmetic (see e.g. Girard 1982)-, in this connection it is 
perhaps interesting that Godel already realized in the early forties (in 1943 or before) 
that Herbrand’s proof of his theorem was in need of correction. 
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alent to A. Godel’s definition of A D accomplishes this in such a way that 
explicit realization of existential quantifiers and choices of disjuncts for 
disjunctions are encoded in the verifying x, as may be seen by inspecting 
the clauses (V D ) and (3 d ). As to the required equivalence A <-» A D , note 
that A A D for A prime, and intuitionistically (A A B) D <-> A D A B D , 
{A V B) D ^ A D V B D and {3zA{z))° <-+ 3 zA(z) D -, (VzA) D <-> VzA D 
holds if we accept 

AC Vx3yB(x, y) -» 3 YVxB(x, Yx). 

Finally, clause (—>) D can be motivated® by the following sequence of 
steps ( Spector 1962): 

[3xiyA D (x, y) -> 3uivB D (u ,»)] 

[Vx(\/yA D (x, y) ->• 3uVvB D (u ,«))] 

[Vx3u(\/yA D (x, y) -> VvB D (u, w))] 

$/x3uVv(yyA D (x, y) -» B D (u ,»))] 

\ix3vNv3y{A D {x, y) -► B D {u ,»))] ^ 

3UYVxv(A d (x, Yxv) — B d (Ux, v )). 

All steps (i)-(iv) are classically justified, in fact (i) and (iii) are intuition- 
istic; (v) consists in a double application of AC. A further discussion of 
the assumptions involved in (ii) and (iv) will be taken up in 3.3 below. 

It should be pointed out that Godel does not claim any fundamen¬ 
tal significance for D as such; his principal interest lies in achieving a 
conceptual reduction of intuitionistic arithmetic to ‘nearly finitistic’ no¬ 
tions; that the translation used is perhaps very tortuous is irrelevant to 
this aim. 

The verification of the main result (I) proceeds by induction on the 
length of deductions in HA, that is, by showing that, for suitable t, 

(*) Th A D (t,y) 

holds for each axiom A of HA and is preserved by the rules of inference. 
Details are given in Troelstra 1973. There are only two delicate points 
involved. One comes when verifying (*) for instances of induction in 
HA. It may be seen that what is needed in T for this purpose is the 
generalization of the rule of induction mentioned by Godel in note il. 


« Godel’s 


motivation, in the 


presented differently. 
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This rule is in fact derivable in T, as asserted by Godel (i3); for a proof, 
see Troelstra 1973, 1.7. 

A second point occurs in verifying (*) for the seemingly innocuous 
axioms of the form A —> A A A. Here we need not only the decidability 
of prime formulas but the stronger fact that there exist characteristic 
terms for them, that is to say that for each prime formula A(x) with 
free variables among the x there is a term Ia such that tyi x = 0 <-> A{x). 
To see how the need for La arises, let us consider the interpretation of 
A —> A A A. Let A D = 3xVyAp; then (A-*iA A) D becomes 

pxVyAp -► 3 afx"Vy'i/'{A D (a!, \J) A A D (a/', i/'))] D = 

3yx'x"\/xy'i/'(A D (x, yxyy") -»■ A D {x'x, y) A A D (x"x, y")). 

Now, if tAxy — 0 <-> Ad(x, y), we can take 

s'EE Ax. X, x" = Ax. X, = Vdtl. 

The definition of y is an instance of definition-by-cases, which is readily 
justified with the help of the recursor constants. Note that the choice of 
interpretation here is in some respects arbitrary; we might equally well 
have taken 

yxy'yf' = j {' if t A xy” ± 0, yxyy" = y otherwise. 

This is to say, the straightforward construction of terms t such that (*) 
holds, by induction on the length of derivations in HA, is not canonical 
at this step. 

If A is arithmetical, Ad will be quantifier-free with equality of type 
0 only (that is, Ad belongs to Co), and the existence of t AD satisfying 
tA D = 0 «-+ Ad is in fact provable in T 0 . The rule for disjunctive 
definition as stated in Godel’s note i2 is in fact equivalent to the existence 
of characteristic functions plus the following more restricted rule for 
definition-by-cases of a functional /, 

fx—t\ if t = 0, fx --- otherwise 

(all free variables of fi, f 2 , t contained in x). 

Next we wish to discuss T as it actually figures in Godel’s text. We 
believe that T as described in 3.1 corresponds pretty well to Godel’s in¬ 
tentions. Certainly in 1958 Godel wanted T to have a decidable primitive 
notion of equality for each finite type (note 7), since the decidability of 
prime formulas together with intuitionistic logic justifies the decidabil¬ 
ity of all formulas of T, and the formulas may therefore be regarded as 
essentially ‘logic-free’. (No abstract interpretation of the logical opera- 
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tions is involved, as there is, for example, in the proof-interpretation of 

-0 

It is doubtful whether in 1958 Godel had already realized the need 
for characteristic functions in connection with the axiom A—*A/\A. In 
a letter of 1970 J. Diller explicitly drew Godel’s attention to the role of 
characteristic functions. Godel reacted in a letter to Bernays (14 July 
1970) as follows: “I do not understand what it means to say that in 
my proof of the formula pDpApa passage (which is not possible) to 
the characteristic term of a formula is required. What is required is the 
decidability of intensional equations between functions.” 11 

Godel probably regarded the existence of (computable) equality func¬ 
tionals as a concomitant of the decidability of equality; from the passage 
just quoted one cannot tell whether Godel realized that in general (that 
is, at higher types) the decidability of equality does not entail, axiomat- 
ically, the presence of equality functionals at higher types. In any case 
the proof sheets of 1972 contain an earlier, crossed-out version of foot¬ 
note i2 which reads: 

2. The principle of disjunctive definition, added to Axiom 5 in the 
present version of the paper, is the following: A function / may be 
defined by stipulating 

A D f{x) =t u hAd f{ x ) = t 2 , 

where fi, t 2 are terms and A is a formula, both containing only 
previously defined functions and no variables except those of the 
sequence x. This principle is needed for the proofs that the axioms 
1 and 4 of H and the deduction rule 6 of If hold in the interpre¬ 
tation defined below. It can be derived if equality functions with 
the axioms G{f,g) = 0 = / = g are introduced as primitive terms 
at all types. At any rate either disjunctive definition, or the axiom 
for equality functions, must be added to the axioms mentioned in 
the first edition. 

Afterwards Godel must have realized that A D for arithmetical A re¬ 
quired only equations between numerical terms, and he accordingly 
rewrote note i2 to the version we find in the text. 1 Diller and Nahm, on 


h “Ich verstehe nicht was es heissen soil, dass in meinem Beweis der Formel 
p 3 p A p ein (nicht moglicher) Ubergang zum characteristischen Term einer Formel 
notig sei. Was notig ist, ist die Entscheidbarkeit von intensionalen Gleichungen zwi- 
schen Funktionen.” 

‘Thus, in the words of G. Kreisel, the whole issue of decidability of equality and 
characteristic functions for equality at higher types turned out to be a “red herring”, 
at least for Godel’s purposes. 
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the other hand, had attempted to extend Godel’s result to higher-type 
arithmetic formulated in the language £*, that is, £ with quantifiers 
Vx", 3x a added, where equality functionals at all types (or equivalently, 
characteristic functions at all types) are necessary. In 1974 they gave a 
variant A of the translation D which also achieved reduction to 3V-form, 
for which an interpretation theorem similar to (I) could be proved and 
for which no appeal to decidability of prime formulas and characteristic 
functions was necessary (more about this in Section 5.4). 

Godel’s note i4 shows that he realized that one can dispense with all 
references to equality at higher types for Theorem I, provided one refor¬ 
mulates the defining axioms for the functional constants as replacement 
schemas for terms, as we did (3.1(3)) in formulating T 0 . Clearly, in note 
i4 Godel had something like our T 0 in mind. He also observes that one 
can in fact dispense with the propositional operators, reducing every¬ 
thing to a term-equation calculus; in the context of primitive recursive 
arithmetic PRA this fact was first proved by Goodstein (1945; see also 
his 1957). Hilbert and Bernays 1934 (Chapter 7) showed that proposi¬ 
tional combinations of term equations can be replaced by a single term 
equation, but not that the addition of the propositional operators with 
their usual rules is in fact a conservative addition to the term-equation 
calculus. 

In note n, Godel refers to the system S 2 of Spector 1962 minus 
axiom F, corresponding to the system WE-HA“ as described below 
in 3.3. However, the extensionality rule of Spector’s system does not 
seem to fit in with Godel’s decidedly intensional view of equality; and in 
fact Godel’s own paraphrase of E 2 -{F}, when read in combination with 
note i4, rather points to the system HAq (‘quantified To’; see 3.4 
below). 

We do not feel quite certain of the correct reading of note 8. One pos¬ 
sible interpretation is the following: since, for Gddel, T has functionals 
with n arguments as a primitive notion, one can introduce a constant 
functional P of higher type by (P(x i,..., x n ))(yi ,..., y m ) = t, where 
all free variables of t are contained in {aq,..., x n , yi, ..., y m }; with the 
notation Aa; for simultaneous abstraction for a sequence of variables one 
might write P = Xx(Xy.t). So P is of type ((t,ti, ... ,r m ),ai,... ,cr„) 
(Godel’s notation), where a* is the type of Xi, tj of yj, and r of t. 
This has no analogue in the usual formulation of PRA, since there only 
functions with values of type 0 are introduced. This formal difference 
disappears if we apply the Schonfinkel method for reducing everything 
to unary functions, both to PRA and to T. On this reading, Godel’s 
note refers only to a fairly superficial formal difference between PRA 
and T. 
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3.3 Extensions to finite type arithmetic; constructive content of D 


It is quite illuminating to see what happens to Godel’s result (I) 
if one tries to extend it from HA to various systems of intuitionistic 
higher-type arithmetic. Let £ ( * and C* be the languages of T 0 and T, 
respectively, extended with higher-type quantifiers. Let HAq be the 
extension of To to £g with intuitionistic predicate logic; HA“ is the 
corresponding extension of T to £*, but without assuming decidability 
of equality at higher types; I-HA" is HA W with characteristic functions 
I E a for equality at type a, a e T, added. E\ is certainly not defin- 
I able in HA", since all type-2 functionals of HA" are continuous, while 
Xy 1 .E(Xx° .0)y is not continuous in y 1 . 

An extensional variant E-HA 1 " in the language £q is obtained by 
defining equality at higher types as extensional equality, assuming the 
usual equality axioms for this defined equality at all types. A weakly 
extensional variant WE-HA" has, instead of the substitution schema 
f i = fg A A\t{\ —> A[f 2 ], the weaker rule 

tiXi ...x n = t 2 xi ...x n , A(ti) => A(t 2 ), 


where xi, ..., x n is a string of variables, not occurring free in t\,t 2 , such 
that t\X\... x n , t 2 x i... x n are of type 0. Schematically 


HA C HAq 


C I-HA" 
C E-HA" 


C WE-HA" C E-HA 1 ". 


As shown in Rath 1978 , HA" is in fact conservative over HAq . 

If H is one of the systems HA", HA 1 ", I-HA 1 ", WE-HA 1 ", E-HA 1 ", 
qf-H denotes the fragment without quantifiers (“quantifier-free”), so 
qf-HAg = T 0 , and T C qf-I-HA 1 ". 

For H any one of I-HA 1 ", WE-HA 1 " or HAg, we can show straight¬ 
forwardly the following extensions of Godel’s (I): 


(II) H b A(x) => qf-H b Aj)(tx, y, x) for a suitable sequence t of 
closed terms; 


(III) H b A => H b A° (soundness, as a corollary to II). 


Soundness must necessarily fail for HA", as shown by Howard ( Troel- 
stra 1973, 3.5.6), since soundness for systems in the language C* 
would entail = z 1 V -ly 1 = z 1 ), i.e., a weak form of decid- 
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ability, which cannot be proved in HA"; and soundness also fails for 
E-HA", since no functional of E-HA" can satisfy the D translation of 
\/z^°x 1 y 1 (x — y —* zx — zy) (Howard, in Troelstra 1973, Appendix). 
Of course, there is an indirect method for interpreting E-HA": first 
interpret this theory in HAq by hereditarily restricting all quantifiers in 
HAq to elements of type a which respect extensional equality, and then 
apply the Dialectica interpretation; this is the road taken in Luckhardt 
1973. 

WE-HA" as an intermediate possibility is not very attractive: the 
deduction theorem does not hold for this theory. (However, it does hold 
for qf-WE-HA"; in this respect the comments in Troelstra 1973, 1.6.12, 
are misleading and partly wrong. See also Godel’s note n.) 

Let us now return to a discussion of the equivalence A <-+ A D . If the 
sole aim of the translation were to associate with each A a classically 
equivalent (modulo AC) A! of the form 3 xVyA"(x,y), any standard 
recipe for rewriting A in prenex normal form, followed by a number of 
applications of AC so as to bring all 3 in front (“Skolemization”), would 
do the job. However, A D is designed so as to keep as close as possible 
to an intuitionistic reading of A, by minimizing the non-intuitionistic 
steps in the translation from A to A D . The only clause which needs 
inspection in this connection is (—i > D ), since the other clauses involve 
only AC and transitions valid by intuitionistic logic. As noted already 
in 3.2, the crucial steps in the definition of (A —> B)° are (ii) and (iv). 
Step (ii) is an instance of the schema 

IP' (V® Ci 3yC 2 ) -» 3y(\/xC 1 -» C 2 ) 

(Ci, C*2 quantifier-free, y not free in C\, x not free in C2). The notation 
IP' derives from “Independence of Premiss”, since, assuming VxC 1 —* 
3yC 2 , IP' requires that we can a priori indicate y, independently of the 
truth of V®Ci; an intuitionistic reading of V®Ci —> 3yC 2 requires only 
that, once a proof of V®Ci is given, we can find a y (possibly depending 
on the given proof). 

Step (iv) can be justified on the basis of the following generalization 
of “Markov’s principle”: 

M' -1 'ivC —> 3v~iC (C quantifier-free) 

((iv) is justified if Bd(u, v) holds; if not, apply M' with A D (x, y) for C). 
Markov’s principle as accepted by the Russian constructivist school is 
the special case where v is of type 0 and C primitive recursive. 

For the interpretation in terms of abstract proofs, IP' and M' are 
usually not accepted as valid, and AC only in special cases (in particular 
where x is of type 0). However, nothing in the intuitionist point of view 
prevents us from giving a more special interpretation to the logical oper- 
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ators in the context of a given language, provided that interpretation 
is itself meaningful and intelligible from an intuitionistic point of view 
(which is certainly the case for D ). For the same reasons, however, one 
should question Godel’s remark (at the end of note h) that his inter¬ 
pretation is more constructive than the proof-interpretation because it 
validates Markov’s schema. In fact, Markov’s schema is false for some 
perfectly coherent intuitionistic theories such as the theory of lawless se¬ 
quences (see Troelstra 1977, Chapter II), while Godel himself, at the end 
of note c, regards choice sequences as coming close to being finitistic. 

Not only does the Dialectica interpretation validate Markov’s princi¬ 
ple, but in fact any instance of M', IP' and AC. As a result, we obtain 
the following strengthening of (II) and (III), for the same systems H as 
before: 

(IV) H + IP' + M' + AC h A <-> A°, and 

H + IP' + M' + AC I- A => qf-H h An(t,y) for suitable t. 

The first half of (IV) also holds for H = E-HA" or HA", and is im¬ 
plicit in Kreisel 1959 (2.11,3.51). The second half of (IV) was explicitly 
stated in Yasugi 1963 for WE-HA". 


4. Models for T 

For the sake of definiteness, we shall here identify Godel’s T with our 
T of 3.1. Three models for T are mentioned in Godel’s paper: the com¬ 
putable functionals, the hereditarily recursive operations, and the term 
model. We shall now briefly discuss each of them in turn. 

4.1 The computable functionals of finite type 

These are informally described by Godel as follows: the computable 
functions of type 0 are the natural numbers; a computable function of 
type ( ct)t is a well-defined mathematical procedure which, applied to a 
computable function of type <r, yields a computable function of type r. 
Here “well-defined mathematical procedure” must be taken as an un¬ 
derstood primitive notion, and Godel stresses (note 6) the parallel with 
the role of “constructive proof” in Heyting’s explanations of the logical 
operators. 

The precise wording of the description of this notion together with 
the explanatory note 6 caused Godel a good deal of trouble. In a letter 
to Bernays, dated 17 December 1968, Godel, referring to note 1 of 1958 
(corresponding to note 6 of 1972), observes that “it seems to be very 
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difficult (or impossible) to make this more precise and yet to maintain 
it to its full extent”. 

In the description of computable function, the proof sheets have, in¬ 
stead of “this general fact is constructively evident ... without any fur¬ 
ther explanation [6]”, the following crossed-out passage: “this general 
fact is intuitionistically demonstrable. This definition of ‘computable 
function of type (to, fi,..., tfc)’ must be accepted as having a clear mean¬ 
ing without any further explanation [6] provided one already has clear 
ideas of the meanings of the phrases ‘computable function of type £»’ for 
i — 0,1,..., k.” (Godel’s [6] is note 6 in 1972 as reproduced here.) 

As Bernays observed in a letter to Godel of 12 December 1970, this 
makes it appear as if the notion of computable function depends on 
the general notion of intuitionistic proof, contrary to Godel’s intentions. 
Consequently, Godel replaced “intuitionistically demonstrable” by “con¬ 
structively evident”, but he remained dissatisfied as is shown by his let¬ 
ters to Bernays (that of 26 December 1972, in particular). His note h, 
discussed below, may be seen as an attempt to interpret T with the help 
of a narrower concept of proof that is more obviously independent of the 
general notion of intuitionistic proof. 

As Godel realized, it was not possible to avoid a certain “impred- 
icativity” in the notion of a finite-type function. “Impredicative” here 
refers to the fact that, e.g., functionals of type (0)0 could be defined via 
functionals of much more complex types—just as an intuitionistic proof 
of a statement may perhaps refer to proofs of more complex statements. 
It is this fact which makes it difficult to formulate the epistemological 
gain obtained in replacing the general concept of intuitionistic proof by 
“computable function of finite type” or by one of the interpretations 
discussed in 4.2 and 4.3 below. 

4.2 The hereditarily recursive operations HRO 

For each a e T, define V a , the set of (Godel numbers of) hereditarily 
recursive operations as follows 

V 0 N, V( a ) T := {x:\fye V a 3z e V T ({x}(y) ~ z)}. 

Here {■} are the “Kleene-brackets” for partial recursive function appli¬ 
cation; i.e., with Kleene’s T-predicate and result-extracting function U, 

(x}(t/) ~ z <-» 3u(T(x, y, u ) A U{u) = z). 

Equality at each type a is interpreted as equality of Godel numbers. 
The reader should note that with increasing complexity of the type a 
the arithmetical complexity (level in the arithmetical hierarchy) of the 
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predicates V a increases without bound, and that x eV a as a predicate of 
x is certainly not recursive for <r/0. Thus the only “reductive gain” of 
the interpretation of T in terms of HRO is that arbitrary arithmetical 
predicates are explained in terms of the more special predicates x eV a . 
For details, see Troelstra 1973, 2.4.8. Clearly it is this model to which 
Godel refers in notes g and h as based on Turing’s notion of a com¬ 
putable function. 

4.3 The term model 

Many variants of this can be given; we chobse a simple version that 
is easy to describe and is suitable for illustrating Godel’s intentions. 

Redexes are terms of one of the following forms: Ktt', Stft ", Rtf 0, 
Rtt'(S + t"), where t, f, t" do not contain subterms that are redexes; 
these convert respectively to t, tt"(t't"), t, t'(Rtt't")t". It can be shown 
that each closed term t can be reduced to a unique redex-free term (the 
normal form of t), by successively converting redexes occurring as sub¬ 
terms. We use NF for the set of closed terms in normal form. 

The most straightforward method for proving this is by means of so- 
called j ‘computability’ predicates ( Tait 1967). Let t, t' range over closed 
terms, and put 

Comp 0 (t) := t is of type 0 and reduces to normal form, 
Comp( CT ) T (<) := t reduces to normal form 

and VP e Comp^tt' e Comp r ). 

The proof then proceeds by noting that all applicative combinations 
of computable terms are computable, and that all constants are com¬ 
putable; afterwards one can prove the normal form to be unique. 

For the model of T one takes the t e NF of type a to be the objects of 
type a. Equality is interpreted as literal identity (hence it is recursive!) 
and application App (s,t) as the (unique) normal form of s(t). 

In view of Godel’s choice of terminology (“reductive proof”) in note 
nl, it is tempting to think he had something like a term model, defined 
via reductions, in mind. But there is no conclusive evidence for this. 
Though we may assume that Tait 1967 was known to Godel, he does 
not refer to it in the paper, nor does he ever refer to Tait’s work in his 
letters to Bernays. Nevertheless, the model NF may be used in an effort 
to understand Godel’s intentions in note h—at the same time revealing 
problematic aspects. 


-iThe terminology ‘hereditarily normalizable’, instead of ‘computable’, would have 
been more appropriate. 
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So let us attempt to interpret note h in terms of NF, taking T 0 for 
T. We may then think of To as an equation calculus of finite-type func¬ 
tionals, and take Godel’s T' to be To interpreted in NF. ‘Reductively 
provable’ for an equation between closed terms t = s would then mean 
that the equation can be verified by reducing t, s to their normal forms 
t N , s N and finding that t N = s N . This is indeed decidable, and the 
proof-procedure is defined in advance (see note hi,2). 

Of course, the proofs of T 0 are not reductive proofs as such; but ac¬ 
cording to Godel, it should be possible to justify all of them on the basis 
of the notion of reductive proof alone (note h3: “no other concept of 
proof ... occurs in The simplest way to interpret this claim is to 

verify that NF is indeed a model for To (see Troelstra 1973, 2.5). This 
can be done in a fairly straightforward manner, though it is not entirely 
trivial; the obvious metamathematical argument relies on induction plus 
the existence of a unique normal form for each closed term. In fact, we 
have for equations between closed terms t and s 


HAq I -t = s 


(■&t = s true in NF <&t = s reductively provable, by definition). Here 

(1) holds by the soundness of Godel’s interpretation ((II) in Section 3.3), 

(2) by the fact that NF is a model for T 0 . 

One might also think of a stricter interpretation of Godel’s claim, 
namely, that it ought to be possible to justify all proofs in To by a 
method that would explicitly transform any proof of a closed term equa¬ 
tion in To into a reductive proof. (For proofs with free variables in the 
conclusion, we ought then to require that this be possible for each sub¬ 
stitution of closed terms for the free variables of the conclusion.) 

A step in such a transformation could be the replacement of an ap¬ 
plication of the induction rule A(0), A(n) —> A(S + n) => A(m) for 
a numeral fn, by m applications of modus ponens to proofs of A(0), 
A(0) —> A( 1),..., A(m — 1) —> A(m) (see Godel’s observation on induc¬ 
tion in note n). 

Though it is possible to carry through this stricter interpretation, it 
is certainly not trivial. 


4.4 Assessment of the reduction achieved by Godel 


We now return to the issues left hanging in 2.4. The “impredicativity” 
of T has already been mentioned. As will have become clear from 4.2 
and 4.3, narrowing down the intended interpretation does not remove 
this feature: we may make the description of the intended model of T 
very definite and concrete, but the “impredicativity” then pops up in 
the arguments needed to show that the interpretations for T are correct. 
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Not only the V„, but also the arithmetized version of the predicates 
Comp^, run through all stages of the arithmetical hierarchy as the com¬ 
plexity of a increases. Godel’s aim was to replace the abstract intuition- 
istic logical notions by a notion of functional, as concrete as possible; he 
succeeded in fact in eliminating the logic except for the logic hidden in 
the precise definition of the intended class of functionals. (In 4.1, we got 
rid of the logic by accepting “computable functional” as a primitive; in 
4.2 and 4.3 the remaining logic resides in the predicates x eV„, respec¬ 
tively x e Comp^.) 

If we look at the generalization of Godel’s result, for example to HAq , 
we see that the logic is “absorbed”, under the interpretation, by the no¬ 
tion of higher-type functional. In short, there is some reductive gain 
though it is not clear-cut; we think it falls short of Godel’s aims. k 


5. Later research flowing from Godel 1958 

In this section we intend to present a brief survey of research more or 
less directly inspired by Godel’s paper. 1 As principal themes we distin¬ 
guish 

(1) extensions of Godel’s main result to other systems, 

(2) investigations of the functionals needed for the interpretation, 

(3) metamathematical applications, 

(4) the study of related interpretations. 

To each of these topics we devote a subsection. 

5.1 Extensions of Godel’s main result 

Already in Kreisel 1959 it was observed that Godel’s result is easily 
extended to intuitionistic arithmetic with transfinite induction. 

Spector (1962) extended Godel’s result to analysis formalized with 
function variables by adding to T a new definition principle, the schema 


k We should mention, however, that Godel, as late as 1974, expressed himself 
in the following terms, writing about 1958: “...the most direct way of arriving at 
an intuitionistic interpretation of T ... does not pass through Heyting’s logic, or 
the general intuitionistic concept of proof or implication, but rather through much 
narrower (and in principle decidable) concepts of ‘provable’ and ‘implies’. Thus the 
implicit use of ‘implication’ and ‘demonstrability’ in the definition of ‘computable 
function of finite type’ does not give rise to any circularity” (from a letter, or draft 
of a letter, to a Mr. Sawyer, then a graduate student at the University of Pittsburgh; 
we do not know whether the letter was actually sent or not). 

’Some papers connected with the Dialectica interpretation and Godel’s T not 
explicitly reviewed below are: Kreisel 1959a, 1959b, Grzegorczyk 1964, Diller and 
Schutte 1971, Vesley 1972, Schwichtenberg 1973, 1975, 1979, Goodman 1976 and 
Scott 1978. 
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of bar recursion at all finite types. Bar recursion (BR) is closely related 
to the axiom schema BI of bar induction at all finite types, and BR 
can in fact be justified with the help of BI. Brouwer’s “bar theorem” is 
equivalent to BI 0 , bar induction of type 0. Godel, in note d, refers to 
Spector’s work, but does not distinguish clearly between bar recursion 
and bar induction. The justification of BI 0 mentioned by Godel refers 
to the method of elimination of choice sequences," 1 which reduces BI 0 
to the theory of a single generalized inductive definition. 

Spector’s work was later refined by Howard (1968) and Luckhardt 
(1973). Howard (1972) also extended Godel’s result to a theory of 
abstract constructive ordinals (again a theory of a single generalized 
inductive definition, namely an abstract version of Kleene’s recursive 
ordinals), thereby realizing a possibility suggested by Godel at the end 
of note d. 

Quite recently Friedrich (1984, 1985) has carried the interpretation 
through for analysis extended with a game quantifier. 

Girard (1971, 1972) was the first to treat classical analysis and finite 
type theory formulated with set (predicate) variables of finite type and 
full comprehension. Girard had to invent several new technical devices, 
such as the introduction of a type structure with variable types and the 
use of ‘reducibility candidates’, in establishing normalization for his sys¬ 
tem of functionals. The latter idea led to direct proofs of normalization 
and cut-elimination for analysis and the theory of types (Girard 1972, 
Prawitz 1971, Martin-Lof 1971). 

Maafi (1976) gave a treatment of predicative analysis. Koletsos 
(1985) extended Godel’s interpretation to Girard’s /?-logic. 

5.2 Investigations of extensions of Godel’s T 

Godel’s interpretation provides a consistency proof for arithmetic 
modulo the assumption that closed terms of type zero have a unique 
numerical value, which precludes a proof of 0 = 1. If one does not want 
to rely on the insight that the computable functionals are a model of T, 
one can try to prove the assumption by analyzing the computation of 
terms of T. 

Thus many investigations have been devoted to showing that terms of 
T and some of its extensions can always be reduced to normal form. In 
particular, for closed terms of type 0 this entails that they can be shown 
to be equal to a numeral, that is, all closed terms of type 0 can be evalu¬ 
ated. 


m The result as stated in Godel’s source, Kreisel 1965, is not quite correct. See 
Kreisel and Troelstra 1970 for a corrected version. 



238 


Note to 1958 and 1972 


As to the methods for proving normalization, they are principally of 
two kinds: (a) by defining suitable computability predicates ( Dragalin 
1968, Tail 1967, 1971, Girard 1971, 1972, Luckhardt 1973, Vogel 1977) 
and (b) by ordinal assignments to terms ( Hinata 1967, Diller 1968 , 11 
Howard 1970, 1980, 1981, 1981a). The method of assigning ordinals 
less than c (l to terms of T can be used as another route to Gentzen’s 
theorem that the consistency of PA can be established in primitive recur¬ 
sive arithmetic plus quantifier-free eo-induction. In Tait 1965 Gentzen’s 
result is obtained via an assignment of infinite terms to functionals of 
T; the infinite terms of type zero are shown, by means of quantifier-free 
eo-induction, to have a unique numerical value. Infinite terms are also 
used in Howard 1972. 

Sanchis (1967) and Diller (1968) establish normalization of the type- 
zero terms of T by means of bar induction (Diller also gave an ordinal 
assignment, as noted above); Hanatani (1975) uses cut-elimination for 
a system like HA". 


5.3 Metamathematical applications 

Here we give some examples only. One of the first applications is in 
Kreisel 1959. There a constructive interpretation of formulas of analysis 
is given by combining the translation D with specific models for T. A 
typical result is the following: if a formula A of analysis does not contain 
3 or V, then A <-> (A D )* holds classically. Here (A°)* is obtained from 
A D = 3 xWyAo by letting y range over the continuous functionals and x 
over the recursive continuous functionals (see e.g. Troelstra 1973, 2.6.5). 

Conservative extension results can be obtained from the axiomati- 
zation of A° <-> A (IV in 3.3 above); thus e.g. H + IP' + M' + AC is 
conservative over H for 3V-formulas if H = I-HA", WE-HA" or HAq . 
Other examples are given in Troelstra 1973 (3.5.14, 3.6.6, 3.6.18 (iii)). 

One of the best-known applications of D is to show closure under 
Markov’s rule ( Troelstra 1973, 3.8.3); recently, the more elegant and 
more widely applicable 0 method of Friedman (1978) and Dragalin 
(1980) has become available. 


n Diller’s assignment is not optimal, that is to say, he uses ordinals beyond to- 
°It remains to be seen whether the new method of proof yields better results 
in extracting effective bounds from classical proofs of If^-statements. In this con¬ 
nection see the discussion in Kreisel and Macintyre 1982. On the use of the no¬ 
counterexample interpretation and the Dialectica interpretation for the extraction of 
explicit bounds from classical proofs, see also Girard 1982 and Kreisel 1982. 
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Examples of applications in the proof theory of classical systems are 
Parsons’ use (1970) of the Dialectica interpretation for an analysis of 
subsystems of arithmetic and Feferman’s use (1971 and 1977, 8.6.2) of 
the interpretation in the study of subsystems of classical analysis by 
means of T relativized to non-constructive functionals. 


5.4 Related interpretations 

Towards the end of Section 3.2 we mentioned the variant interpreta¬ 
tion due to Diller and Nahm ( 1974 )• For this interpretation, one widens 
the notion of “quantifier-free” by permitting bounded numerical quan¬ 
tifiers Vx° < t in addition to A, V, —and then one associates with 
each A of HA“ a translation A A of the form 3xVyA A (x. y), where A a 
is quantifier-free in the wider sense. An interpretation result completely 
similar to (I) in 3.2 can then be proved (see TYoelstra 1973, 3.5.17). 

The Diller-Nahm interpretation was extended to analysis in Diller 
and Vogel 1975, and to systems with self-applicable operators in Beeson 
1978. 

Stein (1976, 1978, 1980) interpolated an infinite sequence of interpre¬ 
tations between A and modified realizability, a functional interpretation 
originally devised by Kreisel to show underivability of Markov’s princi¬ 
ple (Kreisel 1959; see Troelstra 1973, 3.4). Moreover, all these inter¬ 
pretations can be seen as special cases of a single interpretation M in a 
language with “set-types” (see also Rath 1978, Diller 1979). For theo¬ 
ries with decidable prime formulas, A is equivalent to D . 

Normalization and cut-elimination, D , A , modified realizability and 
realizability interpretations all give explicit realizations for numerical ex¬ 
istential statements proved in intuitionistic arithmetic. It can be shown 
that all these methods can be made to yield the same realizations (Mints 
1974, 1975, 1979; Stein 1976, 1980, 1981) —a fact which is by no means 
obvious, since the choice of terms for the Dialectica interpretation is not 
always canonical (see the discussion in Section 3.2 above). 

In conclusion, we mention a variant of the Dialectica interpretation of 
a different nature, due to Shoenfield (1967). This variant is directly ap¬ 
plicable to classical first-order arithmetic PA formalized using V, V, -1. 
To each A of the language of PA a formula A s of the form \/x3yA s (x, y) 
is assigned, where 3y now abbreviates with A s quantifier-free. A s 

is said to be valid if, for some term sequence t, A s (x, t) is derivable in (a 
version of) Godel’s T. Then A s is valid for each A provable in PA. This 
reduction bypasses the “negative translation” of PA into HA ( Godel 
1933e). See also Troelstra 1973, 3.5.18. 
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5.5 Concluding remarks 

As the preceding survey will have made clear, Godel’s paper led to 
many interesting results of a technical nature. But also, notwithstanding 
the fact that Godel did not quite achieve his own aims (see 4.4), the work 
connected with his paper has taught us several facts of philosophical 
interest. For example, the issue of the interpretation of A —> A A A made 
us aware of the role of decidable equality and of the contrast between 
intensionally and extensionally conceived functionals (see 3.2, 3.3 and 
5.4). 

The most important insight is perhaps that the use of logic can be 


Uber eine bisher noch nicht beniitzte Erweiterung 
des finiten Standpunktes 
( 1958 ) 

P. Bernays hat wiederholt darauf hingewiesen, 1 dass angesichts der Tat- 
sache der Unbeweisbarkeit der Widerspruchsfreiheit eines Systems mit 
geringeren Beweismitteln als denen des Systems selbst eine Uberschreitung 
des Rahmens der im Hilbertschen Sinn finiten Mathematik notig ist, um 
die Widerspruchsfreiheit der klassischen Mathematik, ja sogar um die der 
klassischen Zahlentheorie zu beweisen. Da die finite Mathematik als die 
der anschaulichen Evidenz definiert ist, 2 so bedeutet das (wie auch von 
Bernays (1935, p. 62 und 69) explizit formuliert wurde), dass man fur den 
Widerspruchsfreiheitsbeweis der Zahlentheorie gewisse abstrakte Begriffe 
braucht. Dabei sind unter abstrakten (oder nichtanschaulichen) Begriffen 
solche zu verstehen, die wesentlich von zweiter oder hoherer Stufe sind, 
das heisst, die nicht Eigenschaften oder Relationen konkreter Objekte (z. 
B. von Zeichenkombinationen) beinhalten, sondern sich auf Denkgebilde 
(z. B. Beweise, sinnvolle Aussagen usw.) beziehen, wobei in den Beweisen 
Einsichten fiber die letzteren gebraucht werden, die sich nicht aus den kom- 
binatorischen (raumzeitlichen) Eigenschaften der sie darstellenden Zeichen¬ 
kombinationen, sondern nur aus deren Sinn ergeben. 


1 Vgl. z. B.: Bernays 1941a, p. 144, 147; ferner: Hilbert und Bernays 1939, §5; und: 
Bernays 1954, P- 10. 

2 Vgl. die Hilbertsche Formulierung in Hilbert 1926 , p. 171-173. 





On a hitherto unutilized extension 


241 


replaced by the use of higher-type functionals, for a very limited set of 
types. The study of the functionals in T and its extensions has made it 
clear that formal proofs and functionals are in many ways similar. Thus, 
the work reported in 5.2 above has taught us, among other things, that 
normalization of functionals is, essentially, the same as normalization of 
proofs. 

A. S. Troelstra p 


p This commentary owes much to discussions with my friend and colleague Justus 
Diller. I gratefully acknowledge also the help and extensive comments from J. Daw¬ 
son, G. Kreisel and especially S. Feferman. 


On a hitherto unutilized extension 
of the finitary standpoint 
( 1958 ) 

P. Bernays has pointed out on several occasions 1 that, since the consis¬ 
tency of a system cannot be proved using means of proof weaker than those 
of the system itself, it is necessary to go beyond the framework of what is, in 
Hilbert’s sense, finitary mathematics if one wants to prove the consistency 
of classical mathematics, or even that of classical number theory. Conse¬ 
quently, since finitary mathematics is defined as the mathematics in which 
evidence rests on what is intuitive, 2 certain abstract notions are required 
for the proof of the consistency of number theory (as was also explicitly 
formulated by Bernays in his 1935, pages 62 and 69). Here, by abstract 
(or nonintuitive) notions we must understand those that are essentially of 
second or higher order, that is, notions that do not involve properties or re¬ 
lations of concrete objects (for example, of combinations of signs), but that 
relate to mental constructs (for example, proofs, meaningful statements, 
and so on); and in the proofs we make use of insights, into these mental 
constructs, that spring not from the combinatorial (spatiotemporal) prop¬ 
erties of the sign combinations representing the proofs, but only from their 
meaning. 


1 See, for example, Bernays 1941a, pp. 144 and 147; see also Hilbert and Bernays 
1939, §5, and Bernays 1954, P- 10. 

2 See Hilbert’s formulation in his 1926, pp. 171-173. 
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Obwohl in Ermanglung eines prazisen Begriffs der anschaulichen, bezie- 
hungsweise abstrakten, Evidenz ein strenger Beweis fur die Bernayssche 
Feststellung nicht vorliegt, so kann doch liber ihre Richtigkeit praktisch kein 
Zweifel bestehen, insbesondere seit dem Gentzenschen Beweis fiir die For- 

281 malisierbarkeit aller Rekur|sionen nach Ordinalzahlen < eo in der Zahlen- 
theorie. Denn die Giiltigkeit des Rekursionsschlusses fiir e 0 kann sicher 
nicht unmittelbar anschaulich gemacht werden, wie das zum Beispiel bei J 1 
moglich ist. Das heisst genauer, man kann die verschiedenen strukturellen 
Moglichkeiten, die fur absteigende Folgen bestehen, nicht mehr iibersehen 
und hat daher keine anschauliche Erkenntnis von der Notwendigkeit des 
Abbrechens jeder solchen Folge. Insbesondere kann durch schrittweises 
Ubergehen von kleineren zu grosseren Ordinalzahlen eine solche anschau¬ 
liche Erkenntnis nicht realisiert werden, sondern bloss eine abstrakte 
Erkenntnis mit Hilfe von Begriffen hoherer Stufe. Das letztere wird durch 
den abstrakten Begriff der “Erreichbarkeit” 3 geleistet, welcher durch die 
inhaltliche Beweisbarkeit der Giiltigkeit einer gewissen Schlussweise defi- 
niert ist. Auch ist es im Rahmen der fiir uns anschaulichen Mathematik 
nicht moglich, den Induktionsschluss nach einer hinreichend grossen Ordi- 
nalzahl auf eine Kette anderer Einsichten zuriickzufiihren. Vielmehr fiihrt 
jeder Versuch, das zu tun, zu Induktionen von im wesentlichen derselben 
Ordnung. Ob die Notwendigkeit abstrakter Begriffe bloss durch die prak- 
tische Unmoglichkeit, kombinatorisch allzu komplizierte Verhaltnisse an- 

282 schaulich vorzustellen, 4 bedingt ist oder prinzipielle Griinde hat, | lasst sich 
nicht ohne weiteres entscheiden. Im zweiten Fall miisste nach Prazisierung 
der fraglichen Begriffe ein strenger Beweis fiir das Bestehen jener Not¬ 
wendigkeit moglich sein. 


3 W. Ackermann erklart zwar in 1951, p. 407, dass “erreichbar” einen anschaulichen 
Sinn habe, wenn Beweisbarkeit als formale Beweisbarkeit nach gewissen Eegeln ver- 
standen wird. Aber darauf ist zu erwidern, dass aus dieser anschaulichen Tatsache die 
Giiltigkeit des Schlusses durch transfinite Induktion fiir eine vorgelegte Eigenschaft, nur 
mit Hilfe abstrakter Begriffe (oder mit Hilfe transfiniter Induktion in der Metamathe- 
matik) folgt. Allerdings ist der Begriff “erreichbar”, zumindest fiir Induktionen bis to, 
durch schwachere abstrakte Begriffe ersetzbar (vgl. Hilbert und Bernays 1989). 

4 Man beachte, dass eine adaquate beweistheoretische Charakterisierung einer durch 
Absehen von dieser Schranke idealisierten anschaulichen Evidenz Schlussweisen enthal- 
ten wird, die fiir uns nicht anschaulich sind und die sehr wohl eine Reduktion des in- 
duktiven Schlusses auf den einer wesentlich kleineren Ordnung gestatten konnten. Eine 
andere Moglichkeit, den urspriinglichen finiten Standpunkt zu erweitern, fiir die dasselbe 
gilt, besteht darin, dass man abstrakte Begriffe, die auf nichts anderes als auf finite Be¬ 
griffe und Gegenstande, und zwar in kombinatorisch finiter Weise, Bezug nehmen, mit 
zur finiten Mathematik rechnet und diesen Prozess iteriert. Solche Begriffe sind zum 
Beispiel diejenigen, welche in der Reflexion auf den Inhalt schon konstruierter finiter For- 
282 malismen involviert sind. Ein | dieser Idee entsprechender Formalismus wurde von G. 
Kreisel aufgestellt. Vgl. seinen Vortrag auf dem Internationalen Mathematikerkongress 
in Edinburgh, 1958 iKreisel 1960J. Man beachte, dass bei dieser Art der Erweiterung 
des Finitismus das abstrakte Element in einer wesentlichen schwacheren Form auftritt 
als bei der weiter unten besprochenen oder in der intuitionistischen Logik. 
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In the absence of a precise notion of what it means to be evident, ei¬ 
ther in the intuitive or in the abstract realm, we have no strict proof of 
Bernays’ assertion; practically speaking, however, there can be no doubt 
that it is correct, in particular after Gentzen proved that all recursions on 
ordinals less than eo can be formalized in number theory. For, the validity 
of inference by recursion up to eo surely cannot be made immediately in¬ 
tuitive, as it can up to, say, u> 2 . More precisely, we can no longer survey 
the various structural possibilities that obtain for descending sequences, 
and therefore we cannot intuitively recognize that every such sequence 
will necessarily terminate. In particular, we cannot acquire such knowl¬ 
edge intuitively by passing stepwise from smaller to larger ordinals; we 
can only gain knowledge abstractly by means of notions of higher type. 
This is achieved by means of the abstract notion of ‘accessibility’, 3 which 
is defined by our being able to give an informally understood proof that a 
certain kind of inference is valid. Moreover, within the framework of that 
part of mathematics which is intuitive to us, inference by induction up to 
a sufficiently large ordinal cannot be reduced to a chain of other insights. 
Rather, every attempt to do so leads to inductions of essentially the same 
order. It cannot be determined out of hand whether the need for abstract 
notions is due merely to the practical impossibility of our intuitively imag¬ 
ining states of affairs that are all too complex from the combinatorial point 
of view 4 or whether there are theoretical reasons for it. In the latter case 
it would have to be possible, once the notions in question have been made 
precise, to give a strict proof that this need exists. 


3 To be sure, W. Ackermann tells us in his 1951, p. 407, that ‘accessible’ will be 
intuitively meaningful if provability is understood as formal provability according to 
certain rules. But to this one must reply that, from this intuitive fact, the validity 
of inference by transfinite induction for a given property can be demonstrated only by 
means of abstract notions (or by means of transfinite induction in metamathematics). 
The notion ‘accessible’ can, however, be replaced, at least for inductions up to eo, by 
weaker abstract notions (see Hilbert and Bernays 1939). 

4 Note that, if we were to give an adequate proof-theoretic characterization of idealized 
intuitive evidence while ignoring this limitation, we would use kinds of inference that, for 
us, are not intuitive and that might very well allow us to reduce the inductive inference 
to one of a substantially lower order. The same holds of another possible extension of 
the original finitary standpoint; it consists in adjoining to finitary mathematics abstract 
notions that relate, in a combinatorially finitary way, only to finitary notions and objects, 
and then iterating this procedure. Among such notions are, for example, those that are 
involved when we reflect on the content of finitary formalisms that have already been 
constructed. A formalism embodying this idea was set up by G. Kreisel. See his 1960. 
Note that, when finitism is extended in this way, the abstract element appears in an 
essentially weaker form than in the extension discussed below or in intuitionistic logic. 
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Jedenfalls lehrt die Bernayssche Bemerkung, zwei Bestandteile in der 
finiten Einstellung unterscheiden, namlich erstens das konstruktive Ele¬ 
ment, welches darin besteht, dass von mathematischen Objekten nur in- 
soweit die Rede sein darf, als man sie aufweisen oder durch Konstruk- 
tion tatsachlich herstellen kann; zweitens das spezifisch finitistische Ele¬ 
ment, welches dariiber hinaus fordert, dass die Objekte, iiber welche man 
Aussagen macht, mit welchen die Konstruktionen ausgefiihrt werden und 
welche man durch sie erhalt, “anschaulich” sind, das heisst letzten Endes 
raum-zeitliche Anordnungen von Elementen, deren Beschaffenheit abgese- 
hen von Gleichheit und Verschiedenheit irrelevant ist. (Im Gegensatz dazu 
sind jene Objekte in der intuitionistischen Logik sinnvolle Aussagen und 
Beweise.) 

Es ist die zweite Forderung, welche fallen gelassen werden muss. Dieser 
Tatsache wurde bisher dadurch Rechnung getragen, dass man Teile der 
intuitionistischen Logik und Ordinalzahltheorie zur finiten Mathematik ad- 
jungierte. Im folgenden wird gezeigt, dass man statt dessen fur den Wider- 
spruchsfreiheitsbeweis der Zahlentheorie auch den Begriff der berechen- 
baren Funktion endlichen Typs iiber den naturlichen Zahlen und gewisse 
sehr elementare Konstruktionsprinzipien fur solche Funktionen verwen- 
den kann. Dabei wird der Begriff “berechenbare Funktion vom Typus 
t" folgendermassen erklart: 1. Die berechenbaren Funktionen vom Ty¬ 
pus 0 sind die naturlichen Zahlen. 2. Wenn die Begriffe “berechenbare 
Funktion vom Typus to ”, “berechenbare Funktion vom Typus ti’\ ..., 
“berechenbare Funktion vom Typus tk" (wobei k > 1) bereits definiert sind, 
so wird eine berechenbare Funktion vom Typus (to, ti,. ..tk) definiert als 
eine immer ausfiihrbare (und als solche konstruktiv erkennbare) Operation, 
283 welche jedem fc-tupel | berechenbarer Funktionen der Typen 

eine berechenbare Funktion vom Typus to zuordnet. Dieser Begriff 5 ist 
als unmittelbar verstandlich 6 zu betrachten, vorausgesetzt dass man die 


5 Man kann dariiber im Zweifel sein, ob wir eine geniigend deutliche Vorstellung vom 
Inhalt dieses Begriffs haben, aber nicht dariiber, ob die weiter unten angegebenen Ax- 
iome fur ihn gelten. Derselbe scheinbar paradoxe Sachverhalt besteht auch fur den der 
intuitionistischen Logik zugrunde liegenden Begriff des inhaltlich richtigen Beweises. Wie 
die nachfolgenden Uberlegungen und die intuitionistisch interpretierte Theorie der rekur- 
siven Funktionen und Funktionale zeigen, sind diese beiden Begriffe innerhalb gewisser 
Grenzen als Grundbegriffe durcheinander ersetzbar. Dabei ist zu beachten, dass, wenn 
der Begriff der berechenbaren Funktion nicht implizit den Begriff des Beweises enthal- 
ten soil, die Ausfiihrbarkeit der Operationen unmittelbar aus der Kette der Definitionen 
ersichtlich sein muss, wie das fiir alle Funktionen des weiter unten angegebenen Systems 
T der Fall ist. 

6 A. M. Turing hat bekanntlich mit Hilfe des Begriffs einer Rechenmaschine eine 
Definition des Begriffs einer berechenbaren Funktion erster Stufe gegeben. Aber wenn 
dieser Begriff nicht schon vorher verstandlich gewesen ware, hatte die Frage, ob die 
Turingsche Definition adaquat ist, keinen Sinn. 
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In any case Bernays’ remark teaches us to distinguish two components 
in the finitary attitude; namely, first, the constructive element, which con¬ 
sists in our being allowed to speak of mathematical objects only in so far 
as we can exhibit them or actually produce them by means of a construc¬ 
tion; second, the specifically finitistic element, which makes the further 
demand that the objects about which we make statements, with which the 
constructions are carried out and which we obtain by means of these con¬ 
structions, are ‘intuitive’, that is, are in the last analysis spatiotemporal 
arrangements of elements whose characteristics other than their identity or 
nonidentity are irrelevant. (By contrast, in intuitionistic logic these objects 
are meaningful statements and proofs.) 

It is the second requirement that must be dropped. This fact has hitherto 
been taken into account by our adjoining to finitary mathematics parts of 
intuitionistic logic and the theory of ordinals. In what follows we shall 
show that, for the consistency proof of number theory, we can use, instead, 
the notion of computable function of finite type on the natural numbers 
and certain rather elementary principles of construction for such functions. 
Here the notion ‘computable function of type t’ is defined as follows: 

(1) the computable functions of type 0 are the natural numbers; 

(2) if the notions ‘computable function of type 4’, ‘computable function 
of type ti, ‘computable function of type 4’ (with k > 1) have already 
been defined, then a computable function of type (4,4, • • • ,4) is defined 
as an operation, always performable (and constructively recognizable as 
such), that to every fc-tuple of computable functions of types 4, ■••,4 
assigns a computable function of type 4- This notion 5 is to be regarded 
as immediately intelligible, 6 provided the notions ‘computable function of 
type 4’ (* = 0,1,..., k) are already understood. If we then regard the type 
t as a variable, we arrive at the notion, required for the consistency proof, 
of a computable function of finite type t. 


5 One may doubt whether we have a sufficiently clear idea of the content of this notion, 
but not that the axioms given below hold for it. The same apparently paradoxical 
situation also obtains for the notion, basic to intuitionistic logic, of a proof that is 
informally understood to be correct. As the considerations presented below and the 
intuitionistically interpreted theory of recursive functions and functionals show, these 
two notions are, within certain limits, interchangeable as primitive notions. If the notion 
of computable function is not to implicitly contain the notion of proof, we must see to 
it that it is immediately apparent from the chain of definitions that the operations can 
be performed, as is the case for all functions in the system T specified below. 

6 As is well known, A. M. Turing, using the notion of a computing machine, gave a 
definition of the notion of computable function of the first order. But, had this notion 
not already been intelligible, the question whether Turing’s definition is adequate would 
be meaningless. 
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Begriffe “berechenbare Funktion vom Typus t*” (* = 0,1,... k) bereits ver- 
standen hat. Indem man dann den Typus t als Variable betrachtet, gelangt 
man zu dem fur den Widerspruchsffeiheitsbeweis benotigten Begriff einer 
berechenbaren Funktion endlichen Typs t. 

Als evidente Axiome sind, neben den Axiomen der Identitat (auch fur 
Funktionen 7 ), dem 3. und 4. Peanoschen Axiom und der Substitutions- 
regel fiir freie Variable, keine anderen notig als erstens solche, die es 
gestatten, Funktionen durch Gleichsetzung mit einem aus Variablen und 
vorher definierten Konstanten aufgebauten Term und durch einfache In- 
duktion nach einer Zahlvariablen zu definieren, und zweitens den Schluss 
der vollstandigen Induktion nach einer Zahlvariablen anzuwenden. Das 
heisst die Axiome dieses Systems (es werde T genannt) sind formal fast 
284 dieselben 8 wie die der primitiv rekursiven Zahlentheorie, nur dass | die Va¬ 
riablen (ausser denen, auf die Induktion angewendet wird), sowie auch die 
definierten Konstanten, einen beliebigen endlichen Typus iiber den natiir- 
lichen Zahlen haben konnen. Der Einfachheit halber wird im folgenden 
der zweiwertige Aussagenkalkiil, angewendet auf Gleichungen, hinzugenom- 
men, obwohl die Wahrheitsfunktionen durch zahlentheoretische Funktionen 
ersetzbar sind. Gebundene Variable werden nicht zugelassen. Das Sys¬ 
tem T ist von gleicher Beweisstarke wie ein System der rekursiven Zahlen¬ 
theorie, in dem vollstandige Induktion fur alle Ordinalzahlen < eo (in der 
gewohnlichen Darstellung) zugelassen wird. 

Die Zuruckfiihrung der Widerspruchsfreiheit der klassischen Zahlentheo¬ 
rie auf die des Systems T gelingt mit Hilfe der folgenden Interpretation der 
Heytingschen Zahlentheorie, auf welche ja die klassische zuriickfiihrbar ist: 9 

Es wird jeder Formel F der intuitionistischen Zahlentheorie 10 (deren 
freie Variable in ihrer Gesamtheit mit x bezeichnet werden) eine Formel F' 
der Gestalt (3y)(z)A(y, z, x ) zugeordnet, wobei y und z endliche Reihen von 
Variablen irgendwelcher Typen sind, und A(y, z, x) ein quantorenfreier 
Ausdruck mit keinen andern als den in x,y,z vorkommenden Variablen. 
Die Variablen der Reihen x, y, z, deren Gliederzahl auch 0 sein kann, sind 
samtlich untereinander verschieden. Mit xy wird die aus x und y in dieser 
Reihenfolge zusammengesetzte Reihe bezeichnet. 


7 Identitat zwischen Funktionen ist als intensionale oder Definitionsgleichheit zu ver- 
stehen. 

8 Bei der Definition durch Gleichsetzung mit einem Term tritt insofern ein Unterschied 
auf, als man eine Funktion P hoheren Typs auch durch \P(zi , £2, • ■ ■ x n )](t/i, j/2, - - ■ 
Dm) = E definieren kann. Aber dieser Unterschied fallt weg, falls mehrstellige Funktio¬ 
nen in der von A. Church angegebenen Weise durch einstellige ersetzt werden. 

9 Vgl. Godel 1933e. 

10 Die Zahlentheorie soil so formalisiert sein, dass keine Aussagen- oder Funktionsva- 
riable vorkommen. Die Axiome des Aussagenkalkuls sind als Schemata fiir alle moglichen 
Einsetzungen zu betrachten. 
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Besides the axioms of identity (including those for functions 7 ), Peano’s 
third and fourth axioms, and the rule of substitution for free variables, 
we need no other axioms [for the notion of computable function] than the 
following equally evident ones: (1) axioms that allow us to define functions 
by setting them equal to a term constructed from variables and previously 
defined constants, as well as by simple induction on a number variable; 
(2) axioms that allow us to use inference by mathematical induction on 
a number variable. That is, the axioms of this system (let it be T) are 
formally almost the same 8 as those of primitive recursive number theory, 
the only exception being that the variables (other than those on which 
induction is carried out), as well as the defined constants, can be of any 
finite type over the natural numbers. For the sake of simplicity, we shall, 
in what follows, avail ourselves of the two-valued propositional calculus, 
applied to equations, even though truth functions could be replaced by 
number-theoretic ones. Bound variables are not admitted. The system T 
is of the same proof-theoretic strength as a system of recursive number 
theory in which induction is permitted up to any ordinal less than eo (in 
the usual representation). 

The consistency of classical number theory can be reduced to that of 
the system T by means of the following interpretation of Heyting’s number 
theory, to which, of course, classical number theory is reducible: 9 

To each formula F of intuitionistic number theory 10 (x standing for all 
of its free variables) we assign a formula F' of the form (3y)(z)A(y,z,x), 
where y and z are finite sequences of variables of any type and A(y, z, x) 
is a quantifier-free expression containing no other variables than those oc¬ 
curring in x, y and z. The variables of the (possibly empty) sequences x, 
y and z are understood to be pairwise distinct. Let xy be the sequence 
obtained when x is immediately followed by y. 


'Identity between functions is to be understood as intensional or definitional equality. 
“When we define a function by setting it equal to a term, a difference does occur, 
since we can also define a function P of higher type by the stipulation 
[P(au, x 2 , ■ • ■, In)](si, 2/2, ■ ■ ■, S/m) = E. 

But this difference vanishes if we replace many-place functions by one-place functions 
in the way specified by A. Church. 

9 See Godel 1933e. 

10 Number theory is assumed to be formalized so that no propositional or functional 
variables occur. The axioms of the propositional calculus are to be regarded as schemas 
in which all possible substitutions are permitted. 
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Ferner werden folgende Bezeichnungen verwendet: 

1. v, w sind endliche Reihen von Variablen irgendwelcher Typen; s, t 
sind Zahlvariable; u ist eine Reihe von Zahlvariablen. 

2. ist eine Reihe von Variablen, deren Anzahl und Typen dadurch 
bestimmt sind, dass jede von ihnen auf y als Argumentreihe angewendet 
werden kann und dass die Reihe der so erhaltenen Werte (welche mit V(y) 
bezeichnet wird) hinsichtlich der Anzahl und der Typen ihrer Glieder mit 
der Reihe v iibereinstimmt. 

| 3. Analog wird die Variablenreihe Y (bzw. Z, bzw. Z) hinsichtlich der 

Anzahl und der Typen ihrer Glieder durch die Argumentreihe s (bzw. yw. 
bzw. y) und durch die mit der Reihe der Werte gleichtypige Reihe y (bzw. 
z, bzw. z) bestimmt. 

Funktionen mit 0 Leerstellen und Werten vom Typus r werden mit Ob- 
jekten vom Typus r identifiziert, eingliedrige Variablenreihen mit Varia¬ 
blen. 

Die Zuordnung von F' zu F geschieht durch Induktion nach der An¬ 
zahl k der in F enthaltenen logischen Operatoren. (Die bei der Wahl der 
Symbole fur die gebundenen Variablen zu beachtenden Bedingungen und 
die heuristische Begriindung der Definitionen werden nach den Formeln 
gegeben.) 

I. Fur k = 0 sei F' = F. 

II. Es sei 

F' = ( 3y){z)A(y,z,x) 

und 

G' = {3v){w)B{v,w,u) 


bereits definiert; dann ist per definitionem: 

1. (F A G)' = (3yv){zw)[A(y, z, x) A B{v, w ,«)]. 

2. (F V G)' = (3 yvt){zw)\t — 0 A A{y, z, x). V. t = 1 A B(v, w, «)]. 

3. [( S )F]' = (3y)( S z)A(y( s ),z^). 

4. [(3s)F]'= (3sy)(z)A(y,z,x). 

5. ( F D G)' = (3 VZ)(yw)[A(y,Z(yw),x) D B(V(y),w,u)]. 

6. ^FY = (3Z)(yhA(y,Z(y),x). 

s ist eine beliebige Zahlvariable. Vor Anwendung der Regeln 1-5 sind 
notigenfalls die gebundenen Variablen der Formeln F' und G' so umzube- 
nennen, dass sie samtlich untereinander und von den Variablen der Reihen 
x, u sowie auch von s verschieden sind. Ferner sind die durch Anwendung 
der Regeln 2, 3, 5, 6 neu eingefiihrten gebundenen Variablen der Reihen 
t, y, V, Z. Z so zu wahlen, dass sie untereinander und von den in den 
betreffenden Formeln schon vorkommenden Variablen verschieden sind. 

Man beachte, dass 6. aus 5. folgt, falls durch pD. 0 = 1 definiert 
wird. Zu 5. gelangt man, indem man (fiir die auftretenden Spezialfalle) 
die Aussage (3 x)H{x) D (3 y)R(y) (bzw. (y)R(y) D | (x)H(x)) mit der 
Existenz von fiir alle Argumentreihen vom Typus der Variablenreihe x 
definierten berechenbaren Funktionen identifiziert, welche jedem Beispiel 
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We also use the following notation: 

(1) v and w are finite sequences of variables of any type; s and t are 
number variables; u is a sequence of number variables; 

(2) V is a sequence of variables whose number and types are determined 
thus: each of these variables can take y as an argument sequence, and the 
sequence of values thus obtained (let it be V(y)) agrees with the sequence 
v in the number and types of its terms; 

(3) similarly, the sequence Y (or Z, or Z) is determined, as far as the 
number and types of its terms are concerned, by the argument sequence 
s (or yw, or y, respectively), as well as by the sequence y (or z, or z. 
respectively) whose types are those of the sequence of values. 

Zero-place functions whose values are of type r are identified with objects 
of type t, and one-term sequences of variables are identified with variables. 

The assignment of F' to F proceeds by induction on the number k of 
logical operators contained in F. (The conditions to be observed in choos¬ 
ing symbols for the bound variables, as well as the heuristic justification of 
the definitions, will be given after the formulas.) 

I. For k = 0, let F' = F. 

II. Let 

F' = (3y)(z)A(y,z,x) 

and 

G' = (3 v)(w)B(v,w,v) 

be already defined; then we have by definition 

1. (F A G)' = (3 yv)(zw)[A(y, z, x) A B(v, w ,«)]. 

2. (F V G)' = (3 yvt)(zw)[t — 0 A A(y, z,x) .V. t = 1 A B{v,w,u)\. 

3. [( S )F]' = (3F)MA(y(i), Z ,i). 

4. [(3s)F]' = {3sy)(z)A{y,z,x). 

5. (F D G)' = (3VZ)(yw)[A(y,Z(yw),x) D B(V(y),w,u)]. 

6. (^FY = {3Z)(y)^A(y,Z(y),x). 

Here s is any number variable. Before applying Rules 1-5, we rename, 
if necessary, the bound variables of formulas F' and G' so. that they will 
all be distinct from one another and from the variables of the sequences x 
and u, as well as from s. Further, the bound variables of the sequences t, 
Y, V, Z and Z that are newly introduced when Rules 2, 3, 5 and 6 are 
applied must be chosen distinct from one another and from the variables 
that already occur in the formulas considered. 

Note that 6 follows from 5, in case -<p is defined as p D. 0 = 1. We arrive 
at 5 as follows: we identify (for the special cases at hand) the proposition 
(3 x)H{x) D {3y)R(y) (or (y)R(y) D (x)H(x )) with the existence of com¬ 
putable functions (defined for all argument sequences of the same type as 
the variable sequence x) that to each sequence making the antecedent true 
assign a sequence making the consequent true (or to each sequence making 
the consequent false assign a sequence making the antecedent false). 

Obviously, we do not claim that Definitions 1-6 reproduce the meaning 
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fur das Implicans (bzw. Gegenbeispiel fur das Implicatum) ein Beispiel fur 
das Implicatum (bzw. Gegenbeispiel fur das Implicans) zuordnen. 

Selbstverstandlich wird nicht behauptet, dass die Definitionen 1-6 den 
Sinn der von Brouwer und Heyting eingefiihrten logischen Partikel wieder- 
geben. Wieweit sie diese ersetzen konnen, bedarf einer naheren Unter- 
suchung. Man zeigt leicht, dass, wenn F im Heytingschen System Z der 
Zahlentheorie beweisbar ist, Funktionen Q in T definiert werden konnen, 
fur welche A(Q(x),z,x) in T beweisbar ist. Es ist namlich leicht nachprfif- 
bar, dass diese Behauptung fur die Axiome von Z gilt und ihre Richtigkeit 
sich bei Anwendung der Schlussregeln von Z von den Pramissen auf die 
Konklusion fibertragt. 

Die Verifikation wird besonders einfach, wenn man folgendes Axiomen- 
system der intuitionistischen Logik zugrunde legt: 11 

Axiome: Taut, Add, Perm, die zu diesen dualen Axiome fur A, 
0 = 1 .Dp (~<p wird durch p D. 0 = 1 definiert). 

Schlussregeln: Modus ponens, Einsetzungsregel fur freie Zahlvariable, 
Syll (mit zwei Pramissen), Sum, Exp, Imp, die Regeln fiber das Hinzuffigen 
und Weglassen eines All- (bzw. Existenz-)Zeichens im Implicatum (bzw. 
Implicans) einer bewiesenen Implikation. 

Ffir den Widerspruchsfreiheitsbeweis der klassischen Zahlentheorie 
konnen die V und die 3 enthaltenden Axiome und Schlussregeln wegge- 
lassen werden. Bei alien auf Sum folgenden Regeln stellt sich heraus, dass 
die in T zu beweisende Aussage im wesentlichen dieselbe ist wie die auf 
Grund der Pramisse bereits bewiesene. 

Es ist klar, dass man, von demselben Grundgedanken ausgehend, auch 
viel starkere Systeme als T konstruieren kann, zum Beispiel durch Zulas- 
sung transfiniter Typen oder der von Brouwer ffir den Beweis des “Fan- 
Theorems” 12 benutzten Schlussweisc. 

287 | Zusammenfassung 

P. Bernays hat darauf hingewiesen, dass man, um die Widerspruchsfreiheit der klas¬ 
sischen Zahlentheorie zu beweisen, den Hilbertschen finiten Standpunkt dadurch erwei- 
tern muss, dass man neben den auf Symbole sich beziehenden kombinatorischen Begriffen 
gewisse abstrakte Begriffe zulasst. Die abstrakten Begriffe, die bisher fur diesen Zweck 
verwendet wurden, sind die der konstruktiven Ordinalzahltheorie und die der intuitioni¬ 
stischen Logik. Es wird gezeigt, dass man statt dessen den Begriff einer berechenbaren 
Funktion endlichen einfachen Typs iiber den natiirlichen Zahlen benutzen kann, wobei 
keine anderen Konstruktionsverfahren fur solche Funktionen notig sind, als einfache 
Rekursion nach einer Zahlvariablen und Einsetzung von Funktionen ineinander (mit 
trivialen Funktionen als Ausgangspunkt). 


11 Beziiglich der Bezeichnungen, vgl. Whitehead und Russell 1925, p. xii. Dieselben 
Bezeichnungen werden auch fiir die den Formeln entsprechenden Schlussregeln ver¬ 
wendet. 

12 Vgl. Heyting 1956, p. 42. 
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of the logical particles introduced by Brouwer and Heyting. Further inves¬ 
tigation is needed to see how far these can be replaced by our definitions. 
One can easily show that, if F is provable in Heyting’s system Z of num¬ 
ber theory, then in T functions Q can be defined for which A(Q(x), z,x) 
is provable in T. For one can easily verify that this assertion holds for the 
axioms of Z and that, when we apply any inference rule of Z, it holds for 
the conclusion whenever it holds for the premises. 

The verification is particularly simple if we adopt the following axiom 
system for intuitionistic logic: 11 

Axioms: Taut, Add, Perm, the axioms dual to these for A, and 
0 = 1. D p (~'p is defined as p D . 0 = 1). 

Rules of inference: Modus ponens, the rule of substitution for free num¬ 
ber variables, Syll (with two premises), Sum, Exp, Imp, and the rules for 
inserting or deleting a universal quantifier in the consequent (or an exis¬ 
tential quantifier in the antecedent) of a proved conditional. 

For the consistency proof of classical number theory we can omit the 
axioms and rules of inference containing V or 3. For all the rules following 
Sum it turns out that the proposition to be proved in T is essentially the 
same as the one that has already been proved on the basis of the premise. 

It is clear that, starting from the same basic idea, one can also construct 
systems that are much stronger than T, for example by admitting trans- 
finite types or the sort of inference that Brouwer used in proving the ‘fan 
theorem’. 12 


Abstract 

P. Bernays has pointed out that, in order to prove the consistency of classical num¬ 
ber theory, it is necessary to extend Hilbert’s finitary standpoint by admitting certain 
abstract concepts in addition to the combinatorial concepts referring to symbols. The 
abstract concepts that so far have been used for this purpose are those of the construc¬ 
tive theory of ordinals and those of intuitionistic logic. It is shown that the concept of 
a computable function of finite simple type over the integers can be used instead, where 
no other procedures of constructing such functions are necessary except simple recursion 
by an integer variable and substitution of functions in each other (starting with trivial 
functions). 


11 For the notation see Whitehead and Russell 1925, p. xii. We use the same notation 
for the rules of inference that correspond to the formulas. 

12 See Heyting 1956, p. 42. 
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Postscript to Spector 1962 
{1962) 


[Spector 1962 was published posthumously and edited by G. Kreisel.J 

This important paper [Spector 1962\ was written by Clifford Spector 
during his stay at the Institute for Advanced Study in 1960-1961 under 
a grant from the Office of Naval Research. The discussions P. Bernays 
and I had with Spector (see footnote l) 1 took place after the main result 
(contained in §10 of the paper) had been established already. However, 
it ought to be mentioned that during the time Spector first established 
this result he was in close contact with Kreisel. It was Spector’s express 
intention to give to Kreisel a good deal of credit for his work. Originally 
a joint publication by Spector and Kreisel was envisaged. This plan was 
dropped because Spector had taken over the elaboration by himself and 
because the version of the proof which was to be published was due to 
Spector. Also Spector alone, at that time, was working on an extension 
of the result in the direction of stricter constructivity which he hoped to 
include in his paper. 


1 [Footnote 1 of Spector 1962 was written by Kreisel, as were all the footnotes in 
that paper, and stated in part: “From paragraph 3 of the introduction below, and from 
conversations with Spector, I know that he valued highly his discussions with P. Bernays 
and K. Godel on the subject of the present paper.”J 
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What is Cantor’s continuum problem? 
(1964) 


[This article is a revised and expanded version of Godel 1947. The 
introductory note to both 1947 and 1964 is found on page 154, immediately 
preceding 194 ?-J 


1. The concept of cardinal number 

Cantor’s continuum problem is simply the question: How many points 
are there on a straight line in Euclidean space? An equivalent question is: 
How many different sets of integers do there exist? 

This question, of course, could arise only after the concept of “number” 
had been extended to infinite sets; hence it might be doubted if this exten¬ 
sion can be effected in a uniquely determined manner and if, therefore, the 
statement of the problem in the simple terms used above is justified. Closer 
examination, however, shows that Cantor’s definition of infinite numbers 
really has this character of uniqueness. For whatever “number” as applied 
to infinite sets may mean, we certainly want it to have the property that 
the number of objects belonging to some class does not change if, leaving 
the objects the same, one changes in any way whatsoever their properties 
or mutual relations (e.g., their colors or their distribution in space). From 
this, however, it follows at once that two sets (at least two sets of change¬ 
able objects of the space-time world) will have the same cardinal number 
if their elements can be brought into a one-to-one correspondence, which 
is Cantor’s definition of equality between numbers. For if there exists such 
a correspondence for two sets A and B it is possible (at least theoretically) 
to change the properties and relations of each element of A into those 
of the corresponding element of B, whereby A is transformed into a set 
completely indistinguishable from B, hence of the same cardinal number. 
For example, assuming a square and a line segment both completely filled 
with mass points (so that at each point of them exactly one mass point 
259 is situated), it follows, owing to the | demonstrable fact that there exists 
a one-to-one correspondence between the points of a square and of a line 
segment and, therefore, also between the corresponding mass points, that 
the mass points of the square can be so rearranged as exactly to fill out the 
line segment, and vice versa. Such considerations, it is true, apply directly 
only to physical objects, but a definition of the concept of “number” which 
would depend on the kind of objects that are numbered could hardly be 
considered to be satisfactory. 
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So there is hardly any choice left but to accept Cantor’s definition of 
equality between numbers, which can easily be extended to a definition of 
“greater” and “less” for infinite numbers by stipulating that the cardinal 
number M of a set A is to be called less than the cardinal number N 
of a set B if M is different from N but equal to the cardinal number 
of some subset of B. That a cardinal number having a certain property 
exists is defined to mean that a set of such a cardinal number exists. On 
the basis of these definitions, it becomes possible to prove that there exist 
infinitely many different infinite cardinal numbers or “powers”, and that, 
in particular, the number of subsets of a set is always greater than the 
number of its elements; furthermore, it becomes possible to extend (again 
without any arbitrariness) the arithmetical operations to infinite numbers 
(including sums and products with any infinite number of terms or factors) 
and to prove practically all ordinary rules of computation. 

But, even after that, the problem of identifying the cardinal number of 
an individual set, such as the linear continuum, would not be well-defined 
if there did not exist some systematic representation of the infinite cardi¬ 
nal numbers, comparable to the decimal notation of the integers. Such a 
systematic representation, however, does exist, owing to the theorem that 
for each cardinal number and each set of cardinal numbers 1 there exists 
exactly one cardinal number immediately succeeding in magnitude and 
that the cardinal number of every set occurs in the series thus obtained. 2 
This theorem makes it possible to denote the cardinal number immediately 
succeeding the set of finite numbers by Ho (which is the power of the “de- 
numerably infinite” sets), the next one by Hi, etc.; the one immediately 
succeeding all H, (where i is an integer) by H^, the next one by H^+i, etc. 
The theory of ordinal numbers provides the means for extending this series 
further and further. 


1 As to the question of why there does not exist a set of all cardinal numbers, see 
footnote 15. 

2 The axiom of choice is needed for the proof of this theorem (see Fraenkel and 
Bar-Hillel 1958). But it may be said that this axiom, from almost every possible point 
of view, is as well-founded today as the other axioms of set theory. It has been proved 
consistent with the other axioms of set theory which are usually assumed, provided that 
these other axioms are consistent (see my 1940). Moreover, it is possible to define in 
terms of any system of objects satisfying the other axioms a system of objects satisfying 
those axioms and the axiom of choice. Finally, the axiom of choice is just as evident as 
the other set-theoretical axioms for the “pure” concept of set explained in footnote 14. 
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260 | 2. The continuum problem, the continuum hypothesis, 

and the partial results concerning its truth 
obtained so far 

So the analysis of the phrase “how many” unambiguously leads to a 
definite meaning for the question stated in the second line of this paper: 
The problem is to find out which one of the M’s is the number of points of a 
straight line or (which is the same) of any other continuum (of any number 
of dimensions) in a Euclidean space. Cantor, after having proved that 
this number is greater than No, conjectured that it is Ni. An equivalent 
proposition is this: Any infinite subset of the continuum has the power 
either of the set of integers or of the whole continuum. This is Cantor’s 
continuum hypothesis. 

But, although Cantor’s set theory now has had a development of more 
than seventy years and the problem evidently is of great importance for 
it, nothing has been proved so far about the question what the power of 
the continuum is or whether its subsets satisfy the condition just stated, 
except (1) that the power of the continuum is not a cardinal number of 
a certain special kind, namely, not a limit of denumerably many smaller 
cardinal numbers, 3 and (2) that the proposition just mentioned about the 
subsets of the continuum is true for a certain infinitesimal fraction of these 
subsets, the analytic 4 sets. 5 Not even an upper bound, however large, 
can be assigned for the power of the continuum. Nor is the quality of 
the cardinal number of the continuum known any better than its quantity. 
It is undecided whether this number is regular or singular, accessible or 
inaccessible, and (except for Konig’s negative result) what its character of 
cofinality (see footnote 4) is. The only thing that is known, in addition to 
the results just mentioned, is a great number of consequences of, and some 
propositions equivalent to, Cantor’s conjecture. 6 

This pronounced failure becomes still more striking if the problem is 
considered in its connection with general questions of cardinal arithmetic. 
It is easily proved that the power of the continuum is equal to 2 N °. So 
the continuum problem turns out to be a question from the “multiplica¬ 
tion table” of cardinal numbers, namely, the problem of evaluating a certain 


3 See Hausdorff 1914, P- 68, or Bachmann 1955, p. 167. The discoverer of this 
theorem, J. Konig, asserted more than he had actually proved (see his 1905.) 

4 See the list of definitions on pp. 268-9. 

°See Hausdorff 1935, p. 32. Even for complements of analytic sets the question is 
undecided at present, and it can be proved only that they either have the power No or 
Ni or that of the continuum or are finite (see Kuratowski 1933, p. 246.) 

6 See Sierpiriski 1934 an< l 1956. 
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infinite product (in fact the simplest non-trivial one that can be formed). 
There is, however, not one infinite product (of factors > 1) for which so 
much as an upper bound for its value can be assigned. All one knows 
about the evaluation of infinite products are two lower bounds due to Can¬ 
tor and Konig (the latter of which implies the aforementioned negative 
theorem on the power of | the continuum), and some theorems concerning 261 
the reduction of products with different factors to exponentiations and of 
exponentiations to exponentiations with smaller bases or exponents. These 
theorems reduce 7 the whole problem of computing infinite products to the 
evaluation of and the performance of certain fundamental opera¬ 

tions on ordinal numbers, such as determining the limit of a series of them. 

All products and powers can easily be computed 8 if the “generalized con¬ 
tinuum hypothesis” is assumed, i.e., if it is assumed that 2 N " = M a+1 for 
every a, or, in other terms, that the number of subsets of a set of power M Q 
is M a+1 . But, without making any undemonstrated assumption, it is not 
even known whether or not m < n implies 2 TO < 2 n (although it is trivial 
that it implies 2 m < 2"), nor even whether 2 N ° < 2** 1 . 

3. Restatement of the problem on the basis 
of an analysis of the foundations of set theory 
and results obtained along these lines 

This scarcity of results, even as to the most fundamental questions in 
this field, to some extent may be due to purely mathematical difficulties; it 
seems, however (see Section 4), that there are also deeper reasons involved 
and that a complete solution of these problems can be obtained only by 
a more profound analysis (than mathematics is accustomed to giving) of 
the meanings of the terms occurring in them (such as “set”, “one-to-one 
correspondence”, etc.) and of the axioms underlying their use. Several 
such analyses have already been proposed. Let us see then what they give 
for our problem. 

First of all there is Brouwer’s intuitionism, which is utterly destructive 
in its results. The whole theory of the M’s greater than Mi is rejected as 
meaningless. 9 Cantor’s conjecture itself receives several different meanings, 
all of which, though very interesting in themselves, are quite different from 


n be effected, owing to the results and methods of Tarski 1925. 


8 For regular numbers N 


9 See Brouwer 1909. 
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the original problem. They lead partly to affirmative, partly to negative 
answers . 10 Not everything in this field, however, has been sufficiently clar¬ 
ified. The “semi-intuitionistic” standpoint along the lines of H. Poincare 
and H. Weyl 11 would hardly preserve substantially more of set theory. 

262 | However, this negative attitude toward Cantor’s set theory, and toward 

classical mathematics, of which it is a natural generalization, is by no 
means a necessary outcome of a closer examination of their foundations, 
but only the result of a certain philosophical conception of the nature of 
mathematics, which admits mathematical objects only to the extent to 
which they are interpretable as our own constructions or, at least, can be 
completely given in mathematical intuition. For someone who considers 
mathematical objects to exist independently of our constructions and of 
our having an intuition of them individually, and who requires only that 
the general mathematical concepts must be sufficiently clear for us to be 
able to recognize their soundness and the truth of the axioms concerning 
them, there exists, I believe, a satisfactory foundation of Cantor’s set theory 
in its whole original extent and meaning, namely, axiomatics of set theory 
interpreted in the way sketched below. 

It might seem at first that the set-theoretical paradoxes would doom to 
failure such an undertaking, but closer examination shows that they cause 
no trouble at all. They are a very serious problem, not for mathematics, 
however, but rather for logic and epistemology. As far as sets occur in 
mathematics (at least in the mathematics of today, including all of Can¬ 
tor’s set theory), they are sets of integers, or of rational numbers (i.e., of 
pairs of integers), or of real numbers (i.e., of sets of rational numbers), or of 
functions of real numbers (i.e., of sets of pairs of real numbers), etc. When 
theorems about all sets (or the existence of sets in general) are asserted, 
they can always be interpreted without any difficulty to mean that they 
hold for sets of integers as well as for sets of sets of integers, etc. (respec¬ 
tively, that there either exist sets of integers, or sets of sets of integers, 
or ... etc., which have the asserted property). This concept of set , 12 how- 


10 See Brouwer 1907, I, 9; III, 2. 

11 See Weyl 1932. If the procedure of construction of sets described there (p. 20) 
is iterated a sufficiently large (transfinite) number of times, one gets exactly the real 
numbers of the model for set theory mentioned in Section 4, in which the continuum 
hypothesis is true. But this iteration is not possible within the limits of the semi- 
intuitionistic standpoint. 

12 It must be admitted that the spirit of the modern abstract disciplines of mathemat¬ 
ics, in particular of the theory of categories, transcends this concept of set, as becomes 
apparent, e.g., by the self-applicability of categories (see Mac Lane 1961). It does not 
seem, however, that anything is lost from the mathematical content of the theory if cat¬ 
egories of different levels are distinguished. If there existed mathematically interesting 
proofs that would not go through under this interpretation, then the paradoxes of set 
theory would become a serious problem for mathematics. 
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ever, according to which a set is something obtainable from the integers (or 
some other well-defined objects) by iterated application 13 of the operation 
“set of”, 14 not something obtained by | dividing the totality of all existing 263 
things into two categories, has never led to any antinomy whatsoever; that 
is, the perfectly “naive” and uncritical working with this concept of set has 
so far proved completely self-consistent. 15 

But, furthermore, the axioms underlying the unrestricted use of this con¬ 
cept of set or, at least, a subset of them which suffices for all mathematical 
proofs devised up to now (except for theorems depending on the existence 
of extremely large cardinal numbers, see footnote 20), have been formu¬ 
lated so precisely in axiomatic set theory 16 that the question of whether 
some given proposition follows from them can be transformed, by means 
of mathematical logic, into a purely combinatorial problem concerning the 
manipulation of symbols which even the most radical intuitionist must ac¬ 
knowledge as meaningful. So Cantor’s continuum problem, no matter what 
philosophical standpoint is taken, undeniably retains at least this meaning: 
to find out whether an answer, and if so which answer, can be derived from 
the axioms of set theory as formulated in the systems cited. 

Of course, if it is interpreted in this way, there are (assuming the consis¬ 
tency of the axioms) a priori three possibilities for Cantor’s conjecture: It 
may be demonstrable, disprovable, or undecidable. 17 The third alternative 
(which is only a precise formulation of the foregoing conjecture, that the 
difficulties of the problem are probably not purely mathematical) is the 
most likely. To seek a proof for it is, at present, perhaps the most promis¬ 
ing way of attacking the problem. One result along these lines has been 


13 This phrase is meant to include transfinite iteration, i.e., the totality of sets ob¬ 
tained by finite iteration is considered to be itself a set and a basis for further applications 
of the operation “set of”. 

14 The operation “set of x’s” (where the variable “x” ranges over some given kind of 
objects) cannot be defined satisfactorily (at least not in the present state of knowledge), 
but can only be paraphrased by other expressions involving again the concept of set, 
such as: “multitude of x’s”, “combination of any number of x’s”, “part of the totality of 
x’s”, where a “multitude” (“combination”, “part”) is conceived of as something which 
exists in itself no matter whether we can define it in a finite number of words (so that 
random sets are not excluded). 

lo It follows at once from this explanation of the term “set” that a set of all sets 
or other sets of a similar extension cannot exist, since every set obtained in this way 
immediately gives rise to further applications of the operation “set of’ and, therefore, 
to the existence of larger sets. 

16 See, e.g., Bemays 1937, 1941, 1942, 1943, von Neumann 1925\ cf. also von Neu¬ 
mann 1928a and 1929, Godel 1940, Bemays and Fraenkel 1958. By including very 
strong axioms of infinity, much more elegant axiomatizations have recently become pos¬ 
sible. (See Bemays 1961.) 

17 In case the axioms were inconsistent the last one of the four a priori possible alter¬ 
natives for Cantor’s conjecture would occur, namely, it would then be both demonstrable 
and disprovable by the axioms of set theory. 
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obtained already, namely, that Cantor’s conjecture is not disprovable from 
the axioms of set theory, provided that these axioms are consistent (see 
Section 4). 

It is to be noted, however, that on the basis of the point of view here 
adopted, a proof of the undecidability of Cantor’s conjecture from the ac¬ 
cepted axioms of set theory (in contradistinction, e.g., to the proof of the 
transcendency of i r) would by no means solve the problem. For if the mean¬ 
ings of the primitive terms of set theory as explained on page 262 and in 
footnote 14 are accepted as sound, it follows that the set-theoretical con¬ 
cepts and theorems describe some well-determined reality, in which Can- 
264 tor’s conjecture | must be either true or false. Hence its undecidability from 
the axioms being assumed today can only mean that these axioms do not 
contain a complete description of that reality. Such a belief is by no means 
chimerical, since it is possible to point out ways in which the decision of a 
question, which is undecidable from the usual axioms, might nevertheless 
be obtained. 

First of all the axioms of set theory by no means form a system closed 
in itself, but, quite on the contrary, the very concept of set 18 on which 
they are based suggests their extension by new axioms which assert the 
existence of still further iterations of the operation “set of”. These axioms 
can be formulated also as propositions asserting the existence of very great 
cardinal numbers (i.e., of sets having these cardinal numbers). The simplest 
of these strong “axioms of infinity” asserts the existence of inaccessible 
numbers (in the weaker or stronger sense) > Mo- The latter axiom, roughly 
speaking, means nothing else but that the totality of sets obtainable by 
use of the procedures of formation of sets expressed in the other axioms 
forms again a set (and, therefore, a new basis for further applications of 
these procedures). 19 Other axioms of infinity have first been formulated 
by P. Mahlo. 20 These axioms show clearly, not only that the axiomatic 


18 Similarly the concept “property of set” (the second of the primitive terms of set the¬ 
ory) suggests continued extensions of the axioms referring to it. Furthermore, concepts 
of “property of property of set” etc. can be introduced. The new axioms thus obtained, 
however, as to their consequences for propositions referring to limited domains of sets 
(such as the continuum hypothesis) are contained (as far as they are known today) in 
the axioms about sets. 

19 See Zermelo 1930. 

i0 [Revised note of September 1966: See Mahlo 1911, pp. 190-200, and 1913, pp. 269- 
276. From Mahlo’s presentation of the subject, however, it does not appear that the 
numbers he defines actually exist. In recent years great progress has been made in the 
area of axioms of infinity. In particular, some propositions have been formulated which, 
if consistent, are extremely strong axioms of infinity of an entirely new kind (see Keisler 
and Tarski 1964 and the material cited there). Dana Scott (1961) has proved that one 
of them implies the existence of non-constructible sets. That these axioms are implied 
by the general concept of set in the same sense as Mahlo’s has not been made clear 
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system of set theory as used today is incomplete, but also that it can be 
supplemented without arbitrariness by new axioms which only unfold the 
content of the concept of set explained above. 

It can be proved that these axioms also have consequences far outside 
the domain of very great transfinite numbers, which is their immediate 
subject matter: each of them, under the assumption of its consistency, can 
be shown to increase the number of decidable propositions even in the field 
of Diophantine equations. As for the continuum problem, there is little 
hope of | solving it by means of those axioms of infinity which can be set 265 
up on the basis of Mahlo’s principles (the aforementioned proof for the 
undisprovability of the continuum hypothesis goes through for all of them 
without any change). But there exist others based on different principles 
(see footnote 20); also there may exist, besides the usual axioms, the ax¬ 
ioms of infinity, and the axioms mentioned in footnote 18, other (hitherto 
unknown) axioms of set theory which a more profound understanding of the 
concepts underlying logic and mathematics would enable us to recognize 
as implied by these concepts (see, e.g., footnote 23). 

Secondly, however, even disregarding the intrinsic necessity of some new 
axiom, and even in case it has no intrinsic necessity at all, a probable de¬ 
cision about its truth is possible also in another way, namely, inductively 
by studying its “success”. Success here means fruitfulness in consequences, 
in particular in “verifiable” consequences, i.e., consequences demonstrable 
without the new axiom, whose proofs with the help of the new axiom, how¬ 
ever, are considerably simpler and easier to discover, and make it possible 
to contract into one proof many different proofs. The axioms for the sys¬ 
tem of real numbers, rejected by the intuitionists, have in this sense been 
verified to some extent, owing to the fact that analytical number theory fre¬ 
quently allows one to prove number-theoretical theorems which, in a more 
cumbersome way, can subsequently be verified by elementary methods. A 
much higher degree of verification than that, however, is conceivable. There 
might exist axioms so abundant in their verifiable consequences, shedding 
so much light upon a whole field, and yielding such powerful methods for 
solving problems (and even solving them constructively, as far as that is 
possible) that, no matter whether or not they are intrinsically necessary, 
they would have to be accepted at least in the same sense as any well- 
established physical theory. 


yet (see Tarski 1962 , p. 134). However, they are supported by strong arguments from 
analogy, e.g., by the fact that they follow from the existence of generalizations of Stone’s 
representation theorem to Boolean algebras with operations on infinitely many elements. 
Mahlo’s axioms of infinity have been derived from a general principle about the totality 
of sets which was first introduced by A. Levy (1960). It gives rise to a hierarchy of 
different precise formulations. One, given by P. Bernays (1961), implies all of Mahlo’s 
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4. Some observations about the question: 

In what sense and in which direction may a solution 
of the continuum problem be expected? 

But are such considerations appropriate for the continuum problem? 
Are there really any clear indications for its unsolvability by the accepted 
axioms? I think there are at least two: 

The first results from the fact that there are two quite differently defined 
classes of objects both of which satisfy all axioms of set theory that have 
been set up so far. One class consists of the sets definable in a certain 
manner by properties of their elements; 21 the other of the sets in the sense 
of arbitrary multitudes, regardless of if, or how, they can be defined. Now, 
266 before it has | been settled what objects are to be numbered, and on the 
basis of what one-to-one correspondences, one can hardly expect to be able 
to determine their number, except perhaps in the case of some fortunate 
coincidence. If, however, one believes that it is meaningless to speak of sets 
except in the sense of extensions of definable properties, then, too, he can 
hardly expect more than a small fraction of the problems of set theory to be 
solvable without making use of this, in his opinion essential, characteristic 
of sets, namely, that they axe extensions of definable properties. This char¬ 
acteristic of sets, however, is neither formulated explicitly nor contained 
implicitly in the accepted axioms of set theory. So from either point of 
view, if in addition one takes into account what was said in Section 2, it 
may be conjectured that the continuum problem cannot be solved on the 
basis of the axioms set up so far, but, on the other hand, may be solvable 
with the help of some new axiom which would state or imply something 
about the definability of sets. 22 

The latter half of this conjecture has already been verified; namely, the 
concept of definability mentioned in footnote 21 (which itself is definable 
in axiomatic set theory) makes it possible to derive, in axiomatic set the¬ 
ory, the generalized continuum hypothesis from the axiom that every set 
is definable in this sense. 23 Since this axiom (let us call it “A”) turns 


21 Namely, definable by certain procedures, “in terms of ordinal numbers” (i.e., 
roughly speaking, under the assumption that for each ordinal number a symbol de¬ 
noting it is given). See my papers 1939a and 1940. The paradox of Richard, of course, 
does not apply to this kind of definability, since the totality of ordinals is certainly not 
denumerable. 

22 D. Hilbert’s program for a solution of the continuum problem (see his 1926), which, 
however, has never been carried through, also was based on a consideration of all possible 
definitions of real numbers. 

23 On the other hand, from an axiom in some sense opposite to this one, the negation 
of Cantor’s conjecture could perhaps be derived. I am thinking of an axiom which 
(similar to Hilbert’s completeness axiom in geometry) would state some maximum 
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out to be demonstrably consistent with the other axioms, under the as¬ 
sumption of the consistency of these other axioms, this result (regardless 
of the philosophical position taken toward definability) shows the consis¬ 
tency of the continuum hypothesis with the axioms of set theory, provided 
that these axioms themselves are consistent. 24 This proof in its structure 
is similar to the consistency proof of non-Euclidean geometry by means of 
a model within Euclidean geometry. Namely, it follows from the axioms of 
set theory that the sets definable in the aforementioned sense form a model 
of set theory in which the proposition A and, therefore, the generalized 
continuum hypothesis is true. 

A second argument in favor of the unsolvability of the continuum prob¬ 
lem on the basis of the usual axioms can be based on certain facts (not 
known at Cantor’s time) which seem to indicate that Cantor’s conjecture 
will turn out | to be wrong, 25 while, on the other hand, a disproof of it is 267 
demonstrably impossible on the basis of the axioms being assumed today. 

One such fact is the existence of certain properties of point sets (asserting 
an extreme rareness of the sets concerned) for which one has succeeded in 
proving the existence of non-denumerable sets having these properties, but 
no way is apparent in which one could expect to prove the existence of 
examples of the power of the continuum. Properties of this type (of subsets 
of a straight line) are: (1) being of the first category on every perfect set, 26 
(2) being carried into a zero set by every continuous one-to-one mapping 
of the line onto itself. 27 Another property of a similar nature is that of 
being coverable by infinitely many intervals of any given lengths. But in 
this case one has so far not even succeeded in proving the existence of 
non-denumerable examples. From the continuum hypothesis, however, it 
follows in all three cases that there exist, not only examples of the power 
of the continuum, 28 but even such as are carried into themselves (up to 
denumerably many points) by every translation of the straight line. 29 

Other highly implausible consequences of the continuum hypothesis are 
that there exist: (1) subsets of a straight line of the power of the contin¬ 
uum which are covered (up to denumerably many points) by every dense set 


property of the system of all sets, whereas axiom A states a minimum property. Note that 
only a maximum property would seem to harmonize with the concept of set explained 
in footnote 14. 

24 See my monograph 1940 and my paper 1939a. For a carrying through of the proof 
in all details, my 1940 is to be consulted. 

25 Views tending in this direction have been expressed also by N. Luzin in his 1935, 
pp. 129 ff. See also Sierpinski 1935. 

26 See Sierpinski 1934a and Kuratowski 1933, pp. 269 ff. 

27 See Luzin and Sierpinski 1918 and Sierpinski 1934a. 

28 For the third case see Sierpinski 1934, P- 39, Theorem 1. 

29 See Sierpinski 1935a. 
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of intervals; 30 (2) infinite-dimensional subsets of Hilbert space which con¬ 
tain no non-denumerable finite-dimensional subset (in the sense of Menger- 
Urysohn); 31 (3) an infinite sequence A 1 of decompositions of any set M of 
the power of the continuum into continuum-many mutually exclusive sets 
A l x such that, in whichever way a set A l x . is chosen for each i, II^(Af — A l x .) 
is denumerable. 32 (1) and (3) are very implausible even if “power of the 
continuum” is replaced by “Ni”. 

One may say that many results of point-set theory obtained without us¬ 
ing the continuum hypothesis also are highly unexpected and implausible. 33 
But, true as that may be, still the situation is different there, in that, in 
most of those instances (such as, e.g., Peano’s curves) the appearance to 
the contrary can be explained by a lack of agreement between our intuitive 
geometrical concepts and the set-theoretical ones occurring in the theo- 
268 rems. Also, it is very | suspicious that, as against the numerous plausible 
propositions which imply the negation of the continuum hypothesis, not 
one plausible proposition is known which would imply the continuum hy¬ 
pothesis. I believe that adding up all that has been said one has good 
reason for suspecting that the role of the continuum problem in set theory 
will be to lead to the discovery of new axioms which will make it possible 
to disprove Cantor’s conjecture. 


Definitions of some of the technical terms 

Definitions 4-15 refer to subsets of a straight line, but can be literally 
transferred to subsets of Euclidean spaces of any number of dimensions if 
“interval” is identified with “interior of a parallelepipedon”. 

1. I call the character of cofinality of a cardinal number m (abbreviated 
by “cf(m)”) the smallest number n such that m is the sum of n 
numbers < m. 

2. A cardinal number m is regular if cf(m) = m, otherwise singular. 

3. An infinite cardinal number m is inaccessible if it is regular and has 
no immediate predecessor (i.e., if, although it is a limit of numbers 
< m, it is not a limit of fewer than m such numbers); it is strongly 
inaccessible if each product (and, therefore, also each sum) of fewer 
than m numbers < m is < m. (See Sierpinski and Tarski 1930 , Tarski 
1938.) 


30 See Luzin 1914, P- 1259. 

31 See Hurewicz 1932. 

32 See Braun and Sierpinski 1932, p. 1, proposition (Q). This proposition is equiva¬ 
lent with the continuum hypothesis. 

33 See, e.g., Blumenthal 1940. 
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It follows from the generalized continuum hypothesis that these 
two concepts are equivalent. H 0 is evidently inaccessible, and also 
strongly inaccessible. As for finite numbers, 0 and 2 and no others 
are strongly inaccessible. A definition of inaccessibility, applicable to 
finite numbers, is this: m is inaccessible if (1) any sum of fewer than m 
numbers < m is < m, and (2) the number of numbers < to is to. This 
definition, for transfinite numbers, agrees with that given above and, 
for finite numbers, yields 0,1, 2 as inaccessible. So inaccessibility and 
strong inaccessibility turn out not to be equivalent for finite numbers. 
This casts some doubt on their equivalence for transfinite numbers, 
which follows from the generalized continuum hypothesis. 

4. A set of intervals is dense if every interval has points in common 
with some interval of the set. (The endpoints of an interval are not 
considered as points of the interval.) 

5. A zero set is a set which can be covered by infinite sets of intervals 
with arbitrarily small lengths-sum. 

6. A neighborhood of a point P is an interval containing P. 

7. A subset A of B is dense in B if every neighborhood of any point of 
B contains points of A. 

8. A point P is in the exterior of A if it has a neighborhood containing 
no point of A. 

9. A subset A of B is nowhere dense in B if those points of B which are 
in the exterior of A are dense in B , or (which is equivalent) if for no 
interval I the intersection IA is dense in IB. 

10. A subset A of B is of the first category in B if it is the sum of 
denumerably many sets nowhere dense in B. 

11. A set A is of the first category on B if the intersection AB is of the 
first category in B. 

12. A point P is called a limit point of a set A if any neighborhood of P 
contains infinitely many points of A. 

13. A set A is called closed if it contains all its limit points. 

14. A set is perfect if it is closed and has no isolated point (i.e., no point 
with a neighborhood containing no other point of the set). 

15. Borel sets are defined as the smallest system of sets satisfying the 
postulates: 

(1) The closed sets are Borel sets. 

(2) The complement of a Borel set is a Borel set. 

(3) The sum of denumerably many Borel sets is a Borel set. 

16. A set is analytic if it is the orthogonal projection of some Borel set 
of a space of next higher dimension. (Every Borel set therefore is, of 
course, analytic.) 
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Supplement to the second edition 

Since the publication of the preceding paper, a number of new results 
have been obtained; I would like to mention those that are of special interest 
in connection with the foregoing discussions. 

1. A. Hajnal has proved 34 that, if 2**° / H2 could be derived from the 
axioms of set theory, so could 2 N ° = Mj. This surprising result could greatly 
facilitate the solution of the continuum problem, should Cantor’s contin¬ 
uum hypothesis be demonstrable from the axioms of set theory, which, 
however, probably is not the case. 

2. Some new consequences of, and propositions equivalent with, Can¬ 
tor’s hypothesis can be found in the new edition of W. Sierpinski’s book. 35 
In the first edition, it had been proved that the continuum hypothesis is 
equivalent with the proposition that the Euclidean plane is the sum of de- 
numerably many “generalized curves” (where a generalized curve is a point 

270 set definable | by an equation y = f(x) in some Cartesian coordinate sys¬ 
tem) . In the second edition, it is pointed out 36 that the Euclidean plane can 
be proved to be the sum of fewer than continuum-many generalized curves 
under the much weaker assumption that the power of the continuum is not 
an inaccessible number. A proof of the converse of this theorem would give 
some plausibility to the hypothesis 2 K ° = the smallest inaccessible number 
> Mo- However, great caution is called for with regard to this inference, 36a 
because the paradoxical appearance in this case (like in Peano’s “curves”) is 
due (at least in part) to a transference of our geometrical intuition of curves 
to something which has only some of the characteristics of curves. Note 
that nothing of this kind is involved in the counterintuitive consequences 
of the continuum hypothesis mentioned on page 267. 

3. C. Kuratowski has formulated a strengthening of the continuum hypo¬ 
thesis, 37 whose consistency follows from the consistency proof mentioned 
in Section 4. He then drew various consequences from this new hypothesis. 

4. Very interesting new results about the axioms of infinity have been 
obtained in recent years (see footnotes 20 and 16). 

In opposition to the viewpoint advocated in Section 4 it has been sug¬ 
gested 38 that, in case Cantor’s continuum problem should turn out to be 


34 See his 1956. 

36 See Sierpinski 1956. 

36 See his 1956 , p. 207 or his 1951, p. 9. Related results are given by C. Kuratowski 
(1951, p. 15) and R. Sikorski (1951). 

36a -[Note added September 1966: It seems that this warning has since been vindicated 
by Roy O. Davies (1963).] 

37 See his 1948. 

38 See Errera 1952. 
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undecidable from the accepted axioms of set theory, the question of its 
truth would lose its meaning, exactly as the question of the truth of Euclid’s 
fifth postulate by the proof of the consistency of non-Euclidean geometry 
became meaningless for the mathematician. I therefore would like to point 
out that the situation in set theory is very different from that in geometry, 
both from the mathematical and from the epistemological point of view. 

In the case of the axiom of the existence of inaccessible numbers, e.g., 
(which can be proved to be undecidable from the von Neumann-Bernays 
axioms of set theory provided that it is consistent with them) there is 
a striking asymmetry, mathematically, between the system in which it is 
asserted and the one in which it is negated. 39 

Namely, the latter (but not the former) has a model which can be de¬ 
fined and proved to be a model in the original (unextended) system. This 
means that the former is an extension in a much stronger sense. A closely 
related fact is that the assertion (but not the negation) of the axiom im¬ 
plies new theorems about integers (the individual instances of which can 
be verified by computation). So the criterion of truth explained on page 
264 is satisfied, to some extent, for the assertion, but not for the negation. 
Briefly speaking, only the assertion | yields a “fruitful” extension, while the 271 
negation is sterile outside its own very limited domain. The generalized 
continuum hypothesis, too, can be shown to be sterile for number theory 
and to be true in a model constructible in the original system, whereas for 
some other assumption about the power of 2 Nq this perhaps is not so. On 
the other hand, neither one of those asymmetries applies to Euclid’s fifth 
postulate. To be more precise, both it and its negation are extensions in 
the weak sense. 

As far as the epistemological situation is concerned, it is to be said that 
by a proof of undecidability a question loses its meaning only if the system 
of axioms under consideration is interpreted as a hypothetico-deductive 
system, i.e., if the meanings of the primitive terms are left undetermined. 

In geometry, e.g., the question as to whether Euclid’s fifth postulate is true 
retains its meaning if the primitive terms are taken in a definite sense, i.e., 
as referring to the behavior of rigid bodies, rays of light, etc. The situation 
in set theory is similar; the difference is only that, in geometry, the meaning 
usually adopted today refers to physics rather than to mathematical intu¬ 
ition and that, therefore, a decision falls outside the range of mathematics. 

On the other hand, the objects of transfinite set theory, conceived in the 
manner explained on page 262 and in footnote 14, clearly do not belong to 
the physical world, and even their indirect connection with physical experi¬ 
ence is very loose (owing primarily to the fact that set-theoretical concepts 
play only a minor role in the physical theories of today). 

39 The same asymmetry also occurs on the lowest levels of set theory, where the 
consistency of the axioms in question is less subject to being doubted by skeptics. 
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But, despite their remoteness from sense experience, we do have some¬ 
thing like a perception also of the objects of set theory, as is seen from the 
fact that the axioms force themselves upon us as being true. I don’t see any 
reason why we should have less confidence in this kind of perception, i.e., in 
mathematical intuition, than in sense perception, which induces us to build 
up physical theories and to expect that future sense perceptions will agree 
with them, and, moreover, to believe that a question not decidable now 
has meaning and may be decided in the future. The set-theoretical para¬ 
doxes are hardly any more troublesome for mathematics than deceptions 
of the senses are for physics. That new mathematical intuitions leading to 
a decision of such problems as Cantor’s continuum hypothesis are perfectly 
possible was pointed out earlier (pages 264-265). 

It should be noted that mathematical intuition need not be conceived 
of as a faculty giving an immediate knowledge of the objects concerned. 
Rather it seems that, as in the case of physical experience, we form our 
ideas also of those objects on the basis of something else which is imme¬ 
diately given. Only this something else here is not, or not primarily, the 
sensations. That something besides the sensations actually is immediately 
given follows (independently of mathematics) from the fact that even our 
ideas referring to physical objects contain constituents qualitatively differ¬ 
ent from sensations or mere combinations of sensations, e.g., the idea of 
object itself, whereas, on the other hand, by our thinking we cannot create 
272 any qualitatively new elements, but only | reproduce and combine those 
that are given. Evidently the “given” underlying mathematics is closely 
related to the abstract elements contained in our empirical ideas. 40 It by 
no means follows, however, that the data of this second kind, because they 
cannot be associated with actions of certain things upon our sense organs, 
are something purely subjective, as Kant asserted. Rather they, too, may 
represent an aspect of objective reality, but, as opposed to the sensations, 
their presence in us may be due to another kind of relationship between 
ourselves and reality. 

However, the question of the objective existence of the objects of math¬ 
ematical intuition (which, incidentally, is an exact replica of the question 
of the objective existence of the outer world) is not decisive for the prob¬ 
lem under discussion here. The mere psychological fact of the existence of 
an intuition which is sufficiently clear to produce the axioms of set theory 
and an open series of extensions of them suffices to give meaning to the 
question of the truth or falsity of propositions like Cantor’s continuum hy¬ 
pothesis. What, however, perhaps more than anything else, justifies the 


40 Note that there is a close relationship between the concept of set explained in 
footnote 14 and the categories of pure understanding in Kant’s sense. Namely, the 
function of both is “synthesis”, i.e., the generating of unities out of manifolds (e.g., in 
Kant, of the idea of one object out of its various aspects). 
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acceptance of this criterion of truth in set theory is the fact that contin¬ 
ued appeals to mathematical intuition are necessary not only for obtaining 
unambiguous answers to the questions of transfinite set theory, but also 
for the solution of the problems of finitary number theory 41 (of the type 
of Goldbach’s conjecture), 42 where the meaningfulness and unambiguity 
of the concepts entering into them can hardly be doubted. This follows 
from the fact that for every axiomatic system there are infinitely many 
undecidable propositions of this type. 

It was pointed out earlier (page 265) that, besides mathematical intu¬ 
ition, there exists another (though only probable) criterion of the truth of 
mathematical axioms, namely their fruitfulness in mathematics and, one 
may add, possibly also in physics. This criterion, however, though it may 
become decisive in the future, cannot yet be applied to the specifically 
set-theoretical axioms (such as those referring to great cardinal numbers), 
because very little is known about their consequences in other fields. The 
simplest case of an application of the criterion under discussion arises when 
some set-theoretical axiom has number-theoretical consequences verifiable 
by computation up to any given integer. On the basis of what is known 
today, however, it is not possible to make the truth of any set-theoretical 
axiom reasonably probable in this manner. 


| Postscript 273 

[Revised postscript of September 1966: Shortly after the completion 
of the manuscript of the second edition [19641 °f this paper the ques¬ 
tion of whether Cantor’s continuum hypothesis is decidable from the von 
Neumann-Bernays axioms of set theory (the axiom of choice included) was 
settled in the negative by Paul J. Cohen. A sketch of the proof has ap¬ 
peared in his 1963 and 1964 • It turns out that for all tt T defined by the 
usual devices and not excluded by Konig’s theorem (see page 260 above) 
the equality 2 N ° = H r is consistent and an extension in the weak sense (i.e., 
it implies no new number-theoretical theorem). Whether, for a satisfac¬ 
tory concept of “standard definition”, this is true for all definable K T not 
excluded by Konig’s theorem is an open question. An affirmative answer 
would require the solution of the difficult problem of making the concept of 
standard definition, or some wider concept, precise. Cohen’s work, which 


41 Unless one is satisfied with inductive (probable) decisions, such as verifying the 
theorem up to very great numbers, or more indirect inductive procedures (see pp. 265, 
272). 

42 I.e., universal propositions about integers which can be decided in each individual 
instance. 
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no doubt is the greatest advance in the foundations of set theory since 
its axiomatization, has been used to settle several other important inde¬ 
pendence questions. In particular, it seems to follow that the axioms of 
infinity mentioned in footnote 20, to the extent to which they have so far 
been precisely formulated, are not sufficient to answer the question of the 
truth or falsehood of Cantor’s continuum hypothesis.] 




On an extension of finitary mathematics which 
has not yet been used a 
{ 1972 ) 


[The introductory note to 1972, as well as to related items, is found on 
page 217, immediately preceding 1958^ 


Abstract 


P. Bernays has pointed out that, even in order to prove only the consistency of 
classical number theory, it is necessary to extend Hilbert’s finitary standpoint. He 
suggested admitting certain abstract concepts in addition to the combinatorial concepts 
referring to symbols. The abstract concepts that so far have been used for this purpose 
are those of the constructive theory of ordinals and those of intuitionistic logic. It 
is shown that a certain concept of computable function of finite simple type over the 
natural numbers can be used instead, where no other procedures of constructing such 
functions are necessary except primitive recursion by a number variable and definition 
of a function by an equality with a term containing only variables and/or previously 
introduced functions beginning with the function +1. 


P. Bernays has pointed out 1 on several occasions that, in view of the fact 
that the consistency of a formal system cannot be proved by any deduc¬ 
tion procedures available in the system itself, it is necessary to go beyond 
the framework of finitary mathematics in Hilbert’s sense in order to prove 
the consistency of classical mathematics or even of classical number the- 


1 See: Bernays 1941a, pp. 144, 147, 150, 152; Hilbert and Bernays 1989, pp. 347-349, 
357-360; Bernays 1954, P- 9; cf. also Bernays 1935, pp. 62, 69. 


a The present paper is not a literal translation of the German original published 
in Dialectica (1958). In revising the translation by Leo F. Boron, I have rephrased 
many passages. But the meaning has nowhere been substantially changed. Some minor 
inaccuracies have been corrected and a number of notes have been added, to which the 
letters (a)-(n) refer. I wish to express my best thanks to Professor Dana Scott for 
supervising the typing of this and the subsequent paper ll972aj while I was ill, and to 
Professor Paul Bernays for reading the proof sheets and calling my attention to some 
oversights in the manuscript. 
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ory. Since finitary mathematics is defined 2 as the mathematics of concrete 
intuition, this seems to imply that abstract concepts are needed for the 
proof of consistency of number theory. c An extension of finitism by such 
concepts was explicitly suggested by Bernays in his 1935 , page 69. By ab¬ 
stract concepts, in this context, are meant concepts which are essentially 
of the second or higher level, i.e., which do not have as their content prop¬ 
erties or relations of concrete objects (such as combinations of symbols), but 


2 See Hilbert’s explanation in his 1926, pp. 170-173. b 


b “Concrete intuition”, “concretely intuitive are used as translations of “Anschau- 
ung”, “anschaulich”. The simple terms “ concrete ” or “ intuitive ” are also used in this 
sense in the present paper. What Hilbert means by “Anschauung” is substantially 
Kant’s space-time intuition confined, however, to configurations of a finite number of 
discrete objects. Note that it is Hilbert’s insistence on concrete knowledge that makes 
finitary mathematics so surprisingly weak and excludes many things that are just as in- 
controvertibly evident to everybody as finitary number theory. E.g., while any primitive 
recursive definition is finitary, the general principle of primitive recursive definition is 
not a finitary proposition, because it contains the abstract concept of function. There is 
nothing in the term “finitary” which would suggest a restriction to concrete knowledge. 
Only Hilbert’s special interpretation of it introduces this restriction. 

c Accessibility and some closely related concepts (combined with intuitionistic logic) 
are those that have been used most of all in consistency proofs (see: 1. Gentzen 1936 , 
pp. 555, 558; 2. Lorenzen 1951, p. 99, in particular his “induction of the second kind”; 
3. Schiitte 1954, P- 31; 4. Kreisel 1965, p. 137, 1967, p. 246, and 1968, p. 351, §12; 
5. Takeuti 1957, 1960, 1967.) These concepts create the deceptive impression of being 
based on a concrete intuition of certain infinite procedures, such as “counting beyond 
tv” or “running through” the ordinals smaller than an ordinal a. We do have such an 
intuition, but it does not reach very far in the series of ordinals, certainly no farther 
than finitism. In order to make the concept of accessibility fruitful, abstract conceptions 
are always necessary, e.g., insights about infinitely many possible insights in Gentzen’# 
original definition, which is somewhat different from that given above (see his 1936, p. 
555, line 7). A closer approximation to Hilbert’s finitism can be achieved by using the 
concept of free choice sequences instead of “accessibility” . d 

d This is really an abstract principle about schemes of ramification, which, however, 
by Brouwer and Heyting is stated and proved only for the case that their elements 
are integers (although it is not clear that this fact is substantially used in the proof). 
C. Spector in his 1962 has shown that the abstract principle implies the consistency 
of classical analysis, while Brouwer’s principle yields only the consistency of a certain 
subsystem of it. Unfortunately, however, no satisfactory constructivistic proof is known 
for either one of the two principles (except that, according to G. Kreisel (1965, p. 143), 
the weaker principle can be proved relatively consistent with the other accepted axioms 
of intuitionism). It was G. Kreisel who first suggested using this principle for consistency 
proofs. 

Perhaps the most promising extension of the system T is that obtained by introducing 
higher-type computable functions of constructive ordinals. 




On an extension of finitary mathematics 


273 


rather of thought structures or thought contents (e.g., proofs, meaningful 
propositions, and so on), where in the proofs of propositions about these 
mental objects insights are needed which are not derived from a reflection 
upon the combinatorial (space-time) properties of the symbols representing 
them, but rather from a reflection upon the meanings involved.® 

Due to the lack of a precise definition of either concrete or abstract ev¬ 
idence there exists, today, no rigorous proof for the insufficiency (even for 
the consistency proof of number theory) of finitary mathematics. However, 
this surprising fact has been made abundantly clear through the exami¬ 
nation of induction by eo used in Gentzen’s consistency proof of number 
theory. The situation may roughly be described as follows: Recursion for 
e 0 could be proved finitarily if the consistency of number theory could. On 
the other hand the validity of this recursion can certainly not be made 
immediately evident, as is possible for example in the case of uj 2 . That is 
to say, one cannot grasp at one glance the various structural possibilities 
which exist for decreasing sequences, and there exists, therefore, no imme¬ 
diate concrete knowledge of the termination of every such sequence. But 
furthermore such concrete knowledge (in Hilbert’s sense) cannot be realized 
either by a stepwise transition from smaller to larger ordinal numbers, be¬ 
cause the concretely evident steps, such as a —> o? , are so small that they 
would have to be repeated eo times in order to reach eo- The same is true of 
chains of other concretely evident inferences which one may try to use, e.g., 
Hilbert’s cj-rule to the extent to which it is concretely evident. What can 
be accomplished is only an abstract knowledge based on concepts of higher 
level, e.g., on “accessibility”. This concept can be defined by the fact that 
the validity of induction is constructively demonstrable for the ordinal in 
question. 3 Whether the necessity of abstract concepts for the proof of in¬ 
duction from a certain point on in the series of constructive ordinals is due 
solely to the impossibility of grasping intuitively the complicated (though 


3 W. Ackermann in his 1951, p. 407, says that “accessible” has a concrete meaning if 
demonstrability is understood as formal provability according to certain rules. However, 
it is to be noted that from this concrete fact the validity of the rule of transfinite 
induction applied to a given property follows only with the help of abstract concepts, 
or with the help of transfinite induction in metamathematics. But it is true that, the 
concept of “accessible”, at least for induction up to eo, can be replaced by weaker 
abstract concepts (see Hilbert and Bemays 1939, p. 363ff.); see footnote c. 


e An example is the concept “p implies q ” in the sense of: “From a convincing proof 
of p a convincing proof of q can be obtained”. 
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only finitely complicated) combinational relations involved, 4 or arises for 
some essential reason, cannot be decided off hand. 

In the second case it must be possible, after making the concepts in 
question precise, to give a rigorous proof for the existence of that necessity. 

At any rate Bernays’ observations in his 1935, footnote 1, teach us to 
distinguish two component parts in the concept of finitary mathematics, 
namely: first, the constructivistic element, which consists in admitting ref¬ 
erence to mathematical objects or facts only in the sense that they can 
be exhibited, or obtained by construction or proof; second, the specifically 
finitistic element, which requires in addition that the objects and facts con¬ 
sidered should be given in concrete mathematical intuition. This, as far 
as the objects are concerned, means that they must be finite space-time 
configurations of elements whose nature is irrelevant except for equality 
or difference. (In contrast to this, the objects in intuitionistic logic are 
meaningful propositions and proofs.) 

It is the second requirement which must be dropped. Until now this 
fact was taken into account by adjoining to finitary mathematics parts of 
intuitionistic logic and of the constructivistic theory of ordinal numbers. 
It will be shown in the sequel that, instead, one can use, for the proof of 
consistency of number theory, a certain concept of a computable function 
of finite type over the natural numbers and some very elementary axioms 
and principles of construction for such functions. 

The concept “computable function of type f” is defined as follows: 
1. The computable functions of type 0 are the natural numbers. 2. If the 


4 Note that an adequate proof-theoretic characterization of concrete intuition, in case 
this faculty is idealized by abstracting from the practical limitation, will include induc¬ 
tion procedures which for us are not concretely intuitive and which could very well yield 
a proof of the inductive inference for eo or larger ordinals. Another possibility of ex¬ 
tending the original finitary viewpoint for which the same comment holds consists in 
considering as finitary any abstract arguments which only reflect (in a combinatorially 
finitary manner) on the content of finitary formalisms constructed before, and iterate 
this reflection transfinitely, using only ordinals constructed in previous stages of this 
process. A formalism based on this idea was given by G. Kreisel at the International 
Congress of Mathematicians in Edinburgh, 1958 (Kreisel 1960)? Note that, if finitism 
is extended in this manner, the abstract element appears in an essentially weaker form 
than in any other extension mentioned in the present paper. 


f An unobjectionable version is given in Kreisel 1965, pp. 168-173, 177-178. Theorem 
3.43 on page 172 of these lectures states that eo is the limit of this process. Kreisel wants 
to conclude from this fact that eo is the exact limit of idealized concrete intuition. But 
his arguments would have to be elaborated further in order to be fully convincing. Note 
that Kreisel’s hierarchy can be extended far beyond eo by considering as one step any 
sequence of steps that has been shown to be admissible (e.g., any sequence of eo steps). 
It then provides a means for making the much used concept of accessibility (see footnote 
c above) constructive in a much stricter sense by resolving the general impredicative 
concept of intuitionistic proof into constructed levels of formal proofs. 
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concepts “computable function of type to”, “computable function of type 
ti", “computable function of type 4” (where k > 1), have already 
been defined, then a computable function of type , /.fc) is defined 

to be a well-defined mathematical procedure which can be applied to any 
fc-tuple of computable functions of types t\, fa ,..., tk, and yields a com¬ 
putable function of type to as result; and for which, moreover, this general 
fact is constructively evident. The phrase “well-defined mathematical pro¬ 
cedure” is to be accepted as having a clear meaning without any further 
explanation. 5 The functions occurring in this hierarchy are called “com¬ 
putable functions of finite type over the natural numbers.” 6 


5 It is well-known that A. M. Turing has given an elaborate definition of the concept 
of a mechanically computable function of natural numbers. This definition most cer¬ 
tainly was not superfluous. However, if the term “mechanically computable” had not 
had a clear, although unanalyzed, meaning before, the question as to whether Turing’s 
definition is adequate would be meaningless, while it undoubtedly has an affirmative 
answer.® 

6 One may doubt that, on the basis of the definition given, we have a sufficiently clear 
idea of the content of this concept, but not that the axioms of the system T given in 
the sequel are valid for it. The same apparently paradoxical situation also exists for the 
concept of an intuitionistically correct proof, which is the basis of intuitionistic logic in 
Heyting’s interpretation. As the subsequent discussion will show, these two concepts can 
replace each other in building up intuitionistic logic within number theory. Of course, if 
this replacement is to have any epistemological significance, the concept of computable 
function used and the insight that these functions satisfy the axioms of T given below 
must not implicitly involve intuitionistic logic or the concept of proof as used by Heyting. 
This condition is satisfied for the concept of “computable of finite type” given in the 
text and footnote h. 


g It is easily seen that Turing’s functions (where functions as arguments or values of 
higher-type functions are to be identified with the code numbers of their machines) and 
certain subclasses of them also satisfy the axioms and rules of the system T given below. 
As to the meaning of this fact, see footnote h. 

h An elaboration of this idea would lead to the following: 

1. A narrower concept of proof, which may be called “reductive proof” and which, 
roughly speaking, is defined by the fact that, up to certain trivial supplementations, 
the chain of definitions of the concepts occurring in the theorem together with certain 
axioms about the primitive terms forms by itself a proof, i.e., an unbroken chain of 
immediate evidences. In special applications (as, e.g., in our case) this concept of proof 
can be made precise by specifying the supplementations, the axioms, and the evidences 
to be used. 

Note that in this context a definition is to be considered as a theorem stating the 
existence and unicity of an object satisfying certain conditions and that, in our case, 
it is convenient that a statement regarding the type character of the function defined 
should form part of its definition. 

2. A more special concept of “computable of type t”, obtained by replacing in 
the definition given above the concept of “constructively evident or demonstrable” 
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As far as axioms and rules of inference for this concept are concerned, 
no others are needed except the following: (1) axioms for the two-valued 
propositional calculus applied to equations between terms of equal type, 
(2) the axioms of equality, 7 i.e., x = x and x = y . D. t(x) = t{y) for vari¬ 
ables x, y and terms t of any type, (3) the third and fourth Peano axioms, 
i.e., x + 1 / 0, x + 1 — y + l.D.x = y, (4) the rule of substitution of terms 
of equal type for free variables (bound variables do not occur in the sys¬ 
tem), (5) rules which permit the definition of a function by an equality with 
a term constructed from variables and previously defined functions or by 
primitive recursion with respect to a number variable, (6) the usual version 
of the inference of complete induction with respect to a number variable. 


7 Equality of functions is to be understood as intensional equality. It means that 
the two functions have the same procedure of computation, i.e., (by our definition of 
“computable function”) that they are identical. This is always decidable for two given 
functions, which justifies the application of the two-valued propositional calculus. 


(occurring in it both explicitly, and implicitly through implications of the form: If 
x, y, ... have certain types, then ...) by “reductively provable”. Note that, because 
“reductively provable” is a decidable property, the implications occurring may also be 
interpreted as truth-value functions. 

3. The fact that, if the axioms, rules, and primitive concepts of T (note, e.g., that 
each type is a primitive concept of T) are, by means of the definitions 1, 2, just given, 
replaced by really primitive concepts and insights (or, at least, by something closer to 
real primitives), thus obtaining a system T', only the (in comparison to Heyting’s) in¬ 
comparably narrower concept of reductive proof need be used in the propositions and 
proofs of T', and that, moreover, because these proofs are uniquely determined by the 
theorems, quantifications over “any proof” can be avoided. Note that it is not claimed 
that the proofs in T' are reductive. This is true only in certain cases, in particular for 
the proofs of the axioms of T and of the individual cases of the rules of T (nontrivially 
for those of groups (4) and (5), trivially for the others). What is claimed is only that 
no other concept of proof than that of reductive proof occurs in the propositions and 
proofs of T', except, of course, insofar as any theorem P in intuitionism means: A proof 
of P has been given. Substantially the same method for avoiding the use of Heyting’s 
logic or of the general concept of proof should be applicable also if T is interpreted in 
terms of Turing functions (see note (g)). 

Item 3 shows that the interpretation of intuitionistic logic, in terms of computable 
functions, in no way presupposes Heyting’s and that, moreover, it is constructive and 
evident in a higher degree than Heyting’s. For it is exactly the elimination of such vast 
generalities as “any proof’ which makes for greater evidence and constructivity. 

The higher degree of constructivity also appears in other facts, e.g., that Markov’s 
principle -*(x)<f>(x') D (3x)-if>(x) (see Kleene 1960, page 157, footnote) is trivially prov¬ 
able for any primitive recursive </> and, in a more general setting, for any decidable 
property <j> of any objects x. This, incidentally, gives an interest to this interpretation of 
intuitionistic logic (no matter whether in terms of computable functions of higher types 
or of Turing functions) even if Heyting’s logic is presupposed. 
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Note that the axioms and rules of this system, which will be called T, 1 are 
formally almost the same as those of primitive recursive number theory, 
only that the constants and variables (except those to which induction is 
applied) can have any finite type over the natural numbers. 8 The system 
T has the same deductive power as a system of recursive number theory in 


8 The only other difference consists in the fact that a function P of higher type can also 
be defined by a term equation of the form: [P(xi,X2, ■ ■ ■ , x n )] (yi, y 2 , ■ • • , y m ) = t, where 
t is a term containing no variables except x \,..., x n , yi, ■ ■ ■, ym■ This is a combination 
of two “abstractions”, i.e., applications of the A-operator. Formally, this difference 
vanishes if functions of several arguments are replaced by functions of one argument by 
A. Church’s method. 


‘For a precise description of T the following should be added: 

The primitive symbols of T are: 0, +1, =, variables and defined, constants of any 
finite type, “ application ” of functions to arguments of suitable types (denoted by .(.,.)), 
and propositional connectives. Terms are built solely out of constants, variables, and 
application. Meaningful formulae are truth-value functions of equations between terms 
of equal type. 

Regarding the axioms of T note the following: 

1. The version of complete induction used in the consistency proof is this: 

A(0, x). A(s, F{s, x)) D A(s + 1, s) I- A(a,*) 
where x is a finite sequence of variables of arbitrary types and F a sequence of previously 
defined functions of suitable types (as to the notation used here, see p. 278 below). 

2. For the proofs that the Axioms 1 and 4 of H and the deduction Rule 6 of H hold 
in the interpretation ' defined below, the following principle of disjunctive definition is 
needed: 

A function / may be defined by stipulating 

A D f(x) = ti, -AD/(x) = f 2 , 

where ti,t 2 are terms and A is a truth-value function of equations between number 
terms, both containing only previously defined functions and no variables except those 
of the sequence x. 

3. Both the version of complete induction mentioned under 1 and the disjunctive 
definitions mentioned under 2 can be derived in T, the latter by means of disjunctive 
functions H defined recursively thus: 

ff(0,/,</) = /, H{n + l,f,g) = g. 

However, it seems preferable first to formulate axioms from which the consistency 
proof is immediate, and then reduce them to simpler ones: What adds considerably to 
the simplicity of the consistency proof also is the fact that we avoid extensional equality, 
which is an incomparably more intricate concept than logical identity. 

4. If no attention is paid to the complexity of the consistency proof, the whole calcu¬ 
lus of propositions in T can be dispensed with. For, 1. as applied to number equations 
it can be replaced by certain purely arithmetical devices, 2. as applied to equations of 
higher type it can be altogether omitted if (a) the second equality axiom (group (2)) 
is formulated as a rule of inference (which, incidentally, is used only for substituting 
the definiens and definiendum for each other) and (b) a disjunctive rule of inference is 
introduced which says that, if A follows both from t = 0 and N(t) = 0 (where N is 
defined by: N( 0) = 1, N(x + 1) = 0) by means of the other axioms and rules, except 
the rule of substitution for variables of t, then A may be asserted. 

5. It is a curious fact that axiom group (3) is superfluous due to the recursive defin¬ 
ability of a function 8 by: 5(0) = 0, 8(x + 1) = x, and due to the definability of -73 by 
p O . 1 = 0. For it follows immediately that: x+1 = y+ 1. Z) ■ 8{x+l) = 5(i/+l) ,Z>.x = y 
and x + 1 = 0. D . S(x + 1) = 5(0) .D.x = 0.3.1 = 0;on the other hand -i(l = 0) by 
definition of ->. 
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which complete induction is permitted for all ordinal numbers less than e 0 - 

The reduction of the consistency of classical number theory to that of 
the system T is achieved by means of the following interpretation of intu- 
itionistic number theory, to which classical number theory is reducible: 9 

With every formula F of the system H of intuitionistic number theory 10 
a formula F' of the form (3y)(z)A(y, z, x) is associated, where x is the 
sequence of free variables of F, y and z are finite sequences of variables of 
finite types, and A(y, z, x) is a formula of T containing exactly the vari¬ 
ables occurring in x,y , z. The variables of any one of the three sequences, 
x,y, z (each of which may also be empty), are always mutually distinct 
and distinct from those of the other two sequences. We denote by xy the 
sequence compounded of x and y in this order. k 

Furthermore, the following notation is used in the formulas 1 to 6 below. 

1. v, w are finite sequences of variables which may be of any types; s, t, 
are number variables; u is a finite sequence of number variables. 

2. V is a finite sequence of variables whose number and types are deter¬ 
mined by the fact that each of them can be applied to y as an argument 
sequence, and that the sequence of terms thus obtained (which is denoted 
by V{y)) agrees with the sequence v as regards the number and the types 
of its members. For the empty sequence A we stipulate x(A) = x and 
A(x) = A, so that y(v) is well-determined also in case y or v is empty. 

3. The sequence of variables Y (or Z, or Z, respectively) is determined 
in the same manner, as regards the number and the types of its members, 


9 See Godel 1933e. 

10 H is supposed to be a system containing no propositional or function variables, but 
only number variables. The axioms and deduction rules of logic (given on p. 280 below) 
are to be considered as schemata for all possible substitutions of formulas of the system 
in place of the propositional variables.- 1 


•>For a complete description of the system H used in this paper the following should 
be added: Number-theoretic functions are defined only by primitive recursion and by 
setting the values of a function equal to those of a term composed of variables and 
previously introduced functions. Formulas are what is obtained from equations between 
terms by (iterated) application of propositional connectives and quantifiers. “Sequenzen" 
in Gentzen’s sense, or the descriptive operator t T , are not used. Complete induction is 
formulated as a rule of inference. The axioms of equality are: x = x and x = y. D 
.t(x) = t(y) for any term t(x). Outside of the axioms mentioned in footnote 10 and 
in this footnote, only the third and fourth Peano axioms are assumed. Evidently the 
systems T and H overlap. 

k From here on the reader is asked to pay attention to the fact that the letters and for¬ 
mulas occurring in the subsequent discussions are not, but rather denote, combinations 
of symbols of T or H; or like “(3a:)” or “x(y) n , they denote operations to be performed 
on combinations of symbols yielding other such combinations. This relation of deno¬ 
tation can easily be made perfectly precise. In particular note that in expressions like 
7)” the brackets denote an operation of substitution-, i.e., “A(a, /J, 7)” is to be 
regarded as an abbreviation of “Subst (.A*’^’*)”. Hence A(y,z,x ) = A. 
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by the sequence s (or yw, or y, respectively) and by the sequence y (or z, 
or z, respectively). 

One-member sequences of variables are identified with variables. If x is 
the empty sequence, (3x)A — {x)A = A by definition. 

The correspondence of F' to F is defined by induction on the number 
k of logical operators contained in F. The precautions to be taken in the 
choice of the symbols for the bound variables and the heuristic grounds for 
point 5 of the definition are given below the following formulas. 

I. Let F' = F for k = 0. 

II. Suppose 

F' = (3y)(z)A(y,z,x) 

and 

G' = (3 v)(w)B(v,w,u) 

have already been defined; then, per definitionem, we set: 

1. (F A G)'= (3yv){zw)[A(y, z, x) A B(v , w, a)]. 

2. ( F V GY = (3 yvt)(zw)\t = 0 A A(y, z, x) . V . t = 1 A B(v, w, it)]. 

3. [(^ = (3y)( M )i(y( S ),t,r). 

4. [(3s)F]' = (3sy){z)A(y,z,x), 

where s is a number variable contained in x. 

5. (F D GY = (3 VZ)(yw)[A(y,Z(yw),x) D B(V(y),w,u)\, 
which, by the definition of negation given on page 280 below, 
implies: 

6. W = (3 Z)(yhA(y,Z(y),x). 

Before using Rules 1-6 the bound variables in the formulas F' and G' 
are, if necessary, to be renamed so that they are all mutually distinct and 
different from the variables in the sequences x, u. Furthermore, the bound 
variables of the sequences t, Y, V, Z, Z, which are newly introduced by the 
application of the Rules 2, 3, 5, 6, axe to be so chosen that they are mutually 
distinct and different from the variables already occurring in the formulas 
concerned. 

The right-hand side of 5 is obtained by stepwise transforming the formula 
F' D G' according to the rule that propositions of the form (3x)(...) D 
(3 y)(. ..) (or (y)(.. .) D (x)(. . .), respectively), where x, y may be sequences 
of variables of any types, are replaced by propositions stating that there 
exist computable functions which assign to each example for the implicans 
(or counterexample for the implicatum, respectively) an example for the 
implicatum (or a counterexample for the implicans, respectively), taking 
account of the fact that ~>B D ->A . = . A D B. 1 


'The complexity of the definition of (F V G)' is necessary in order to ensure the 
decidability of V and, thereby, the validity of the inference pDr.qDrhpVq. Dr. 
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Of course it is not claimed that the Definitions 1-6 express the meaning 
of the logical particles introduced by Brouwer and Heyting. The question to 
what extent they can replace them requires closer investigation. It is easily 
shown that, if F is provable in H, then the proposition F' is constructively 
provable in T; to be more precise, if F' = (3y)(z)A(y,z,x), then a finite 
(possibly empty) sequence Q of functional constants can be defined, in T 
such that A(Q(x),z,x) is provable in T. The proof consists in verifying 
that the assertion holds for the axioms of H, and that, if it holds for the 
premises of a given rule of inference of H, it also holds for the conclusion. 

The verification becomes quite simple and straightforward if the follow¬ 
ing axiom system of intuitionistic logic is used: 

Axioms: (1) p D.pAp, (2) pAq.D p, (3) pAq.D.qAp, (4), (5), (6), the 
axioms for V which are dual to these, (7) 0 = 1 .D p. m Negation is defined 
by ~'P ■ — : p D ■ 0 = 1. 

Deduction rules: (1) Modus ponens, (2) substitution of terms for free 
variables, (3) p D q, q D r h p D r, (4) p A q . D r h p D. q D r and vice 
versa, (5) p D q b p D (x)q and vice versa, provided p does not contain x 
as a free variable, (6) p D r, q D r b p V q. D r, (7) p D q b (3 x)p. D q and 
vice versa, provided q does not contain a; as a free variable. 

It turns out that in the deduction rules 4, 5, and 7 the proposition to be 
proved in T is substantially the same as that already proved on the basis 
of the premise." For the consistency proof of classical number theory the 
axioms and deduction rules containing V or 3 can be omitted. 

It is clear that, starting with the same basic ideas, much stronger systems 
than T can be constructed, for example, by admitting transfinite types 
or the methods of deduction used by Brouwer for the proof of the “fan 
theorem”. 11 


11 See Heyting 1956, p. 42, and footnote d. 


For quantifier-free F and G, (F V G) 1 may be defined like (F A G)'. This has the 
consequence that F' = F for all quantifier-free F. 

m Axiom 7 may be omitted without jeopardizing the interpretability of classical in 
intuitionistic number theory (see Johansson 1936). 0 = 1. D ~<p follows from the 
definition of -> and the other axioms and rules. 

n Note moreover the following: The proof of the assertion is trivial for all axioms of 
H and for the deduction rule 2. For the deduction rules 1 and 3 it follows easily from 
the fact that what the formula A(Q(x),z,x) (corresponding to (p D q)') says is exactly 
that and how functions Q for q' can be derived from functions Q for p'. As far as 
complete induction is concerned, note that the conclusion of this inference specialized 
to the integer n is obtained from the premises by n-fold application of substitution and 
modus ponens. 

Note also that the deduction theorem can easily be proved for the interpretation 
Moreover, as Spector has observed (1962, p. 10), the system T enlarged by intuitionistic 
logic with quantifiers for functions of any finite type (i.e., his system £2 — {F}) can be 
interpreted in T in exactly the same way as intuitionistic number theory. The proof is 
carried out in detail in §9, pp. 12-15, of Spector’s paper. 






Introductory note to 1972a 


The item to which this note serves as introduction consists of three 
remarks (indicated in the following as Remarks 1, 2 and 3) on the un¬ 
decidability results, and was found appended to the galley proofs for 
1972. As explained in the introductory note to the latter, Godel ap¬ 
parently worked on 1972 off and on during the period 1965 1972. We 
do not know at what point he considered adding these three remarks, 
but internal evidence suggests that it was later in the period rather than 
earlier. The remarks deal with preoccupations that Godel had with both 
the generality and significance of his incompleteness results ever since 
the publication of his famous 1931 paper. 

In brief, Remark 1 is concerned with improvements in the statement 
of the second incompleteness theorem which increase its scope. This 
remark already appeared (in slightly variant form) as a footnote that 
Godel wrote in 1966 to accompany the translation of his 1932b in van 
Heijenoort 1967. Remark 2 promotes ideas concerning the need for ax¬ 
ioms of infinity in order to overcome incompleteness, ideas first suggested 
in footnote 48a of 1931 and expressed more fully in the 1964 supplement 
to 194 7. Finally, Remark 3 was presented as a footnote to Godel’s 1964 
postscript to his 1934 lectures (on the occasion of their reproduction in 
Davis 1965). Godel was there at pains both to emphasize the generality 
of his incompleteness results, in consequence of Turing’s analysis of the 
concept of “mechanical procedure”, and to reject the idea propounded 
by Turing and Post that these results establish “bounds for the powers 
of human reason”. Another version of this same remark was commu¬ 
nicated to Hao Wang and appeared on pages 325-326 of Wang 1974 
(completed, according to its introduction, in 1972.) 

One may speculate that Godel thought 1972 would be one of his last 
publications and that it provided a final opportunity to stress certain 
fundamental points and themes that he felt had been insufficiently ap¬ 
preciated. In addition, though he makes no explicit reference to 1972 
in these remarks, there is a more than casual connection. For, by the 
use in 1958 and 1972 of a new abstract concept (constructive function 
of finite type) to establish the consistency of elementary number theory, 
Godel illustrated his dictum in Remark 3 that incompleteness is to be 
overcome by the development of human understanding through the use 
of “more and more abstract terms”. 

Despite their brevity, these three remarks broach a wealth of mat¬ 
ters that must be addressed at length if they are to be dealt with at all 
adequately. The introductory notes which follow thus discuss each of 
them separately. The first has been written by me, the second jointly 
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with Robert M. Solovay, and the third by Judson C. Webb. In view of 
the length of these notes, the reader is advised to study Godel’s remarks 
first. 

Solomon Feferman 


Remark 1 

As explained above, this remark, entitled “The best and most general 
version of the unprovability of consistency in the same system” , a essen¬ 
tially reproduces a footnote which first appeared with the translation 
of 1932b in 1967. Godel excuses its reproduction anew, in 1972, by the 
statement that “perhaps it has not received sufficient notice”. 

The remark itself begins with the assertion that the consistency of a 
system S containing elementary number theory Z (directly or by trans¬ 
lation), may be provable in S', indeed, the consistency of very strong S 
may even be provable in a system of primitive recursive number the¬ 
ory. On the face of it, this assertion seems to contradict Godel’s own 
theorem on the unprovability of consistency in (primitive) recursive con¬ 
sistent extensions of arithmetic (1931, 1932b). However, by 1966 there 
were several examples in the literature of systems justifying Godel’s ap¬ 
parently contrary statement here. Those examples demonstrated that 
the applicability of Godel’s theorem on the underivability in a consistent 
system S of the consistency statement Cons depends essentially on how 
S is presented. That is, they showed that, for suitable S, another presen¬ 
tation S* of S could be given, with the same set of theorems, for which 
S* h Cons». In Takeuti 1955 this was done by changing the set of rules 
generating the theorems, in Feferman 1960 by changing the description 
of the set of axioms, and in Kreisel 1965 by changing the description 
of the set of proofs. b Of these, the example by Takeuti is perhaps the 
most natural, since it deals with systems that have established signif¬ 
icance in the literature, namely Gentzen-style sequential systems with or 


a This remark, dated 18 May 1966, was added as a footnote to 1932b in van 
Heijenoort 1967, p. 616. Godel erroneously refers to it as appearing in the translation 
of 1931 rather than 1932b. 

b This glosses over some essential points of difference as to just what is demon¬ 
strated by the examples of Takeuti and Feferman. In the case of the former, only a 
weak form of the consistency of S* is demonstrated in S *, namely a formula Con^, 
which expresses the non-provability of 0=1; the general form, which expresses that 
for each A not both A and -'A are provable, is not provable in the system S* used by 
Takeuti. The example provided by Feferman 1960 is not on its face effective, though 
S* happens to be presented by a formula which binumerates (numeralwise defines) 
in S the same set of axioms as S. 

Note also that the word ‘presentation’ in the text is not used in any specific 
technical sense. 




Note to 1972a 


283 


without the so-called cut rule (a form of modus ponens, or detachment, 
appropriate to Gentzen-style systems). The cut-elimination theorem for 
suitable such S shows that the cut rule is dispensable; the system S* 
is then S without the cut rule. So presented, S* can prove its own 
consistency. 0 The examples due to Feferman and Kreisel have only more 
limited technical significance.* 1 Godel was undoubtedly familiar with all 
these examples, but we do not know whether he had any of them specif¬ 
ically in mind when he referred here to the possibility of a system’s 
proving its own consistency. 

While Godel had sketched a proof of the underivability of consistency 
in 1931, a detailed proof was first given in Hilbert and Bernays 1939 
(pages 285-328) for a system Z of elementary number theory (and a 
related system Z^). The work there was broken into two parts. First, 
three “derivability conditions” Dl-3 were set down on the arithmetical 
formula Provg(x) expressing in S that x is the Godel number of a for¬ 
mula provable from S, and it was shown that, for the sentence Cong 
expressing the consistency of S and defined by 

Cons = Vx-i[Provs(x) A Provs(neg(x))], 

we have S I/ Cons whenever S is consistent and satisfies those conditions 
(ibid., pages 285-288). Second, the derivability conditions were verified 
for Z and Z^. e - 

The first derivability condition, Dl, states that if B follows from A 
in S then S I- Provs( r ^4 n ) —* Provs( r -B n ). f Condition D2 expresses a 
special case of closure of the provable formulas under a rule of numerical 
substitution. Finally, the third condition, D3, expresses a form of the 
adequacy of S for primitive recursive arithmetic, namely that for each 
primitive recursive function / (and corresponding function symbol in 5) 
we can prove in S the formalization of 

if f(m) = 0 then S I- f(m) — 0, 

where m is the numeral for m. Here S must contain primitive recursive 
arithmetic, either directly or by translation. The derivation of Godel’s 
second incompleteness theorem for consistent S also requires that the 


c In the weak sense explained in footnote b. 

d However, Jeroslow (1975) has shown that Feferman’s result applies in a natural 
way to a class of non-effective systems called “experimental logics”. 

e Hilbert and Bernays 1939, pp. 289-328. According to G. Kreisel, this procedure 
followed a plan Godel outlined to Bernays on a transatlantic voyage to the U.S.A. in 
1935. 

f We use r A n for the numeral in S corresponding to the Godel number of A. 



284 


Note to 1972a 


relation Proofs (x,y), which holds when y is the number of a proof 
in S of the formula with number x, be primitive recursive, and that 
Provs(ar) = 3y Proofs (x,y)- 

By the work of Hilbert and Bernays, at least one of the derivability 
conditions Dl-3 must fail for each of the three examples above of (pre¬ 
sentations of) systems which prove their own consistency. The obvious 
candidate in the case of Takeuti’s cut-free system is Dl. However, it was 
later shown by Jeroslow (1973) that the derivability condition Dl is ac¬ 
tually dispensable, in other words that D2 and D3 suffice for Godel’s 
second incompleteness theorem. Moreover, of these it is D3 which is 
crucial, since D2 can generally be trivially verified; indeed, all three ex¬ 
amples above fail to satisfy D3. It happens that the examples due to 
Kreisel and Takeuti both also fail to satisfy Dl, while that of Feferman 
does satisfy Dl. 

The general result stated informally by Gddel in Remark 1 is that a 
certain instance of what is now called the U\-reflection principle for S 
(denoted nJ-RPs in the following) is underivable in S, provided only 
that (i) S contains primitive recursive arithmetic (PRA) and is provably 
closed under the rules of the equational calculus and (ii) that lI ( j’-RP s 
is correct for S. Here n) 1 statements are those of the form Vxf(x) = 0, 
where / is primitive recursive, and IIj-RPs is the scheme 

(n?-RP s ) Prov s ( r vr) -+ A, for A in ll , 1 l . 

This scheme is correct for S if, whenever S P \/xf(x) — 0 with / prim¬ 
itive recursive, then for each natural number m, f(rn) = 0. Actually, 
Godel takes a slightly variant form n^-RP^ of this principle, one which 
expresses that every equation proved in S using only the rules of the 
equational calculus is correct for each numerical instance. It is this form 
that Godel calls the “outer consistency” of SA Thus Godel’s result can 
be restated as: 

(*) If S contains PR A and is outer consistent, then an instance of the 
outer consistency of S is not provable in S. 

Godel does not indicate a proof of (*) here, but such a proof can 
be reconstructed following standard lines for his second incompleteness 
theorem, with a small but essential technical change at one point. First, 
assume again that S has a primitive recursive presentation. Then there 
is a primitive recursive relation Proofs(n,m) which holds just in case 
m is the number of a proof in S of the formula with number n. If we let p 


6 This has not become established terminology. 
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be the characteristic function of this relation, then from the assumption 
that PR A is contained in S we have 

(1) Proof(n,m) implies S b p(n, m ) = 1, and 
->Proof(n, to) implies S b p(n, to) = 0. 

Let Provs(a;) be the formula 3 yp(x,y) = 1. Then 

(2) S b A implies S b Provs( r A’ 1 ). 

In the usual line of argument for the second incompleteness theorem, 
a sentence Go is formed (by diagonalization) in such a way that 

(3) S b [Go <- -iProvsCGo" 1 )]. 

Then one shows (first part of the first incompleteness theorem) that if 
S is consistent, then S I/ Go- By formalizing this argument one obtains 
S b (Cong —► Go), whence S I/ Cons. In the first part, one proceeds 
by assuming S b Go and applies (2) to conclude S b [Prov s ( r G 0 ~ 1 )]; 
then from S b [Go —> -'Provs( r Go" 1 )] and S b Go we conclude that S is 
inconsistent. In formalizing this, one must apply first D3 and then Dl. 

For the new argument here the main technical point is that, with¬ 
out change in the basic diagonal technique used to obtain (3), one can 
construct a sentence G and a primitive recursive term t such that 

(4) (i) G is Vy[p(t, y) = 0], and 
(ii) S b t = r G~ l . 

(In Kreisel and Takeuti 1974 such statements are called literal Godel 
sentences. The first published use of statements of this kind appears 
to have been in Jeroslow 1973.) Now, by analogy with the standard 
argument, one first shows the following: 

(5) If S is outer consistent, then S \f G. 

The proof of (5) proceeds simply, as follows: If S b G and n is the 
number of G, then for some to, p(n, m) = 1. Also, S b Vr/[p(f, y) = 0], 
so S b p{t, to) = 0 and by (4)(ii), S b p(n, to) = 0. Hence if S is outer 
consistent we have p(n,m) = 0, in contradiction to p(n,m) = 1. Now, 
by formalizing this argument for (5), one obtains: 

(6) For a certain instance A of n^-RP^, S b A —» G. 

Hence, under the hypotheses of (*), S \f A. The use of literal Godel 
sentences thus permits one to deal entirely with assumptions about 
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derivability via the equational calculus in S, in place of conditions D1 
and D3. 

For the usual systems, as Godel points out, outer consistency is equiv¬ 
alent to consistency; the argument, which is quite simple, goes back to 
Hilbert. h Moreover, the formal equivalence of Cons with n°-RP' s (and 
at the same time with n°-RPs) is provable in a system satisfying the 
Hilbert Bernays (or similar) derivability conditions. 1 

In fact, as Godel stresses, the question of establishing outer consis¬ 
tency (by finitary means) is the central one for Hilbert’s program (as 
formulated, for example, in Hilbert 1926). Hilbert had divided the 
statements of a language into ideal ones and real ones. By the lat¬ 
ter he meant the purely universal statements, each numerical instance 
of which was subject to a finitary check. Hilbert’s program aimed to 
show that for various systems S encompassing mathematical practice 
the “ideal” statements can be eliminated from derivations of the “real” 
statements, in other words, that the reflection principle holds for the lat¬ 
ter class of statements. Thus the program requires the outer consistency 
of S, for which, as Hilbert observed, it would be sufficient to establish 
the ordinary consistency of S, at least for the usual systems S. But 
Godel’s second incompleteness theorem showed that for these systems 
one cannot hope to prove the consistency of S within S. What Godel 
accomplishes in the present remark is to show, even more generally, that 
one cannot hope to prove the outer consistency of S within S, if indeed 
outer consistency holds for S. Thus, with respect to Hilbert’s program, 
Godel can fairly claim to have established “the best and most general 
version” of his second incompleteness theorem. 

However, Godel ignores generalizations of his incompleteness theo¬ 
rems to other situations, for example to various non-constructive systems 
in Rosser 193 7, Mostowski 1952 , and elsewhere. Nor does he concern 
himself with generalizations of the reflection principles, such as were 
dealt with in Feferman 1962 and in Kreisel and Levy 1968 and which 
have a variety of important applications outside of Hilbert’s program. 
Moreover, systems encompassing ordinary mathematical practice must 
include modus ponens (or the cut rule), so in this respect the kind of 
generalization obtained by Godel is of marginal interest. This is not 
to deny that cut-free systems have been of fundamental importance in 
proof theory (see, for example, Takeuti 1975) or that they provide a use¬ 
ful context in which to illustrate various technical aspects of the use of 
self-referential statements, as shown for example in Kreisel and Takeuti 


h See, for example, page 474 of the 
1967. 


'For details, see 


translation of Hilbert 1928 in 


Heijenoort 


Kreisel and Levy 1968, p. 105, or Smorynski 1977, p. 846. 
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1974■ But for the usual systems and the various non-constructive ex¬ 
tensions that have been considered, it is both much more natural and of 
greater generality to follow the lead of Lob 1955, in which quite elegant 
abstract derivability conditions (modifying those of Hilbert and Bernays) 
proved to be the appropriate means for settling the status of various self- 
referential statements and reflection principles in such systems. Lob’s 
results have been put in an even more general logical context through 
the work of Solovay (1976) on the completeness of certain modal logics 
under the provability interpretation of the necessity operator.^ Still, to 
study the question of applicability of Lob’s derivability conditions, one 
must consider how formal systems may be presented within themselves. 
Here, as Kreisel has often stressed (see for example his 1965, page 154), 
dealing with the question of what constitutes a canonical presentation 
of a formal system becomes the central concern. One solution has been 
provided in Feferman 1982. 

One final technical point concerns incompleteness theorems for sys¬ 
tems (much) weaker than arithmetic, for example those such as PRA 
which are quantifier-free. Godel points out that his “most general” ver¬ 
sion of the second incompleteness theorem can be extended to apply to 
such systems. For the technical tools needed to deal with related ver¬ 
sions of the theorem, see Jeroslow 1973. 

Solomon Feferman 


Remark 2 

This remark begins with what Godel terms “another version of the 
first undecidability theorem”, which concerns the degree of complexity 
(or “complication”, in Godel’s words) of axioms needed to settle prob¬ 
lems of “Goldbach type” of high complexity. Godel had also referred 
to problems of this type in 1964, and he explained there (in footnote 
42) that by such he meant “universal propositions about integers which 
can be decided in each individual instance” . k Most generally, then, such 
propositions are statements of the form VxR(x) with R general recur¬ 
sive (or effectively decidable, by Church’s thesis). It is shown in re¬ 
cursion theory that every such statement is equivalent to one of the 
same form with R primitive recursive, and by definition these comprise 
the class of n® statements. In fact, it is known through the work of 


JSee also Boolos 1979. 

k See p. 269 above. Goldbach’s own statement, dating from his 1742 letter to 
Euler, is the still unsettled conjecture that every even integer is the sum of two 
primes. (For Goldbach and Euler, 1 was a prime.) 
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Matiyasevich that every IlJ statement is equivalent to one of the form 
Vaq ... \/x n \p(x i,..., x n ) yt q(x i,..., a:„)], where p and q are polynomi¬ 
als with integer coefficients and n < 13. 1 

Godel here takes the degree of complexity d(A) of a formula A (in a 
given language) to be the number of basic symbols occurring in it. In 
other words, if, for a given basic stock of symbols Si,..., s m , the for¬ 
mula is written as a concatenation A = sq ... Si k , then d(A) is defined 
to be k. For S a finite set of (distinct) formulas A] ,..., A n , considered as 
a system of non-logical axioms, the degree d(S) is defined to be 

d(Ai) H-1- d(A n ) + (n - 1). The theorem stated informally by Godel 

is that in order to solve all problems A of Goldbach type of a “certain” 
degree k, one needs a system of axioms S with degree d(S) > k, “up to 
a minor correction”. It is not clear what kind of minor correction Godel 
intended here, so we do not know just how he would have stated this 
as a precise result. After examining this question more closely, the au¬ 
thors have arrived at some results of the same character as Godel’s, but 
not quite as strong as what would be suggested by a first reading of his 
assertion; we have not, however, been able to establish the latter itself. 
These various statements and their status are explained as follows. 

Let £ be a language with a finite stock si,..., s m of basic symbols, 
including logical symbols such as ‘-V, ‘A’, ‘V’, a constant symbol ‘O', the 
successor symbol"’, a means for systematically forming variable symbols 
l Vi for i = 0,1,2,... from the basic symbols,” 1 the equality symbol *=’, 
and parentheses ‘(’, *)’. C should also contain symbols, either directly 
or by definition, for a certain number of primitive recursive functions 
/o,..., fj, where /o and /i are + and •, respectively. It is assumed 
that we have a consistent finite axiom system So in C which contains 
(or proves) defining equations for /o,..., fj, and enough of the axiom 
system of primitive recursive arithmetic for these functions in order to 
carry out Godel’s first incompleteness theorem. In particular, So should 
be consistent and complete for Sj sentences (and hence correct for 11° 
sentences). For the assertion of Godel’s being examined here, only those 
systems S are considered which are consistent and contain So- Then the 
following theorem can be proved: 


'See Davis, Matiyasevich and Robinson 1976. Matiyasevich later showed that 
one could take n < 9; see his 1977. 

m One obvious way to do this is to identify Vi with v O "...' where ‘v’ is a new 

basic symbol; this makes = i + 2. However, there are somewhat more efficient 

ways of building Vj from basic symbols, so that d(vi) = log 2 i + 0(log 2 log 2 i): we 
shall assume that such an encoding is being used in the discussion that follows. 
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(*) There are positive integers ci,c 2 such that for all k > C2 and k\ = 
(k — c 2 )/ci, no finite consistent extension S of So with d(S) < fci 
proves all true IT® statements A having d(A) < k. 

The proof of (*) rests on an examination of Gddel’s construction 
in his 1931 (for the first incompleteness theorem) of a true II® state¬ 
ment Gs which is not provable from S for any finite consistent S ex¬ 
tending So- Gs can be regarded as a statement which expresses that 
Conj(S) —► Gs is not logically provable, where Conj(S) = Ai A • • • A A n 
for S — {Ai ,..., A n ). This construction is uniform in S; that is, for a 
suitable II® formula B(v 0 ) with at most Vo free, we have Gs equivalent 
to B( r Conj(S) n ), where r Conj(S)~ 1 is the numeral in £ for a Godel num¬ 
ber of Conj(S). Using this, it may be shown that Gs can be chosen with 
d(Gs) < cid(S) + c 2 , where c\ is a constant depending on the efficiency 
of the Godel numbering of expressions. It turns out that one can take 
ci = [log 2 m] + 1, where m is the number of basic symbols in £.“ For 
the usual logical systems m is between 8 and 16, hence ci = 4. But a 
first reading of Godel’s assertion under consideration would put ci = 1 
in (*); call that assertion (J). (If (J) holds, Godel’s “minor correction” 
would simply be c 2 .) 

The remainder of Gddel’s remark does not depend essentially on 
whether one can obtain (f) or not, but only that we at least have (*). 
For Godel’s way of measuring complexity, the crucial thing is that the 
degree of complexity of axiom systems needed to establish true II® sen¬ 
tences A increases roughly in direct proportion ci to the complexity of 
A, where ci is small. 

We now pass from these technical questions to Godel’s discussion of 
their significance. This shifts, in effect, to systems of set theory. The 
reason is that all of present-day mathematics can be formalized in a rel¬ 
atively simple finite system Si of set theory (for example, the Bernays- 
Godel system of sets and classes). According to Godel, it follows from 
the result (1), or (*), that in order to solve problems of Goldbach type 
which can be formulated in a few pages, the axioms of Si “will have to be 
supplemented by a great number of new ones or by axioms of great com¬ 
plication.” Naturally, one would be led to accept as axioms only those 
statements that are recognized to be evident, though not necessarily 
immediately so for the intended interpretation (that being, in the case 
of BG, sets in the cumulative hierarchy together with arbitrary classes 
of sets). Thus Godel says that one may be led to doubt “whether evident 


"Coders own numbering of expressions in 1931 is rather inefficient and gives a 
comparatively large value for ci- The proof that ci = [log 2 m] + 1 suffices relies 
particularly on the more efficient coding of variables mentioned in footnote m. 
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axioms in such great numbers (or of such great complexity) can exist at 
all, and therefore the theorem mentioned might be taken as an indica¬ 
tion for the existence of mathematical yes or no questions undecidable 
for the human mind.” 

In response to such doubts, Godel points out “the fact that there do 
exist unexplored series of axioms which are analytic in the sense that 
they only explicate the content of the concepts occuring in them”. As his 
main example, he cites the axioms of infinity in set theory, “which assert 
the existence of sets of greater and greater cardinality or of higher and 
higher transfinite types” and “which only explicate the content of the 
general concept of set.” Here Godel repeats ideas broached in 1947 and 
more fully in its revised version 1964-° There he said that the axioms for 
set theory “can be supplemented without arbitrariness by new axioms 
which only unfold the concept of set ...” (1964, page 264). Moreover, 
the axioms of set theory are recognized to be correct by a faculty of 
mathematical intuition, which Godel says is analogous to that of sense 
perception of physical objects: “... we do have something like a percep¬ 
tion also of the objects of set theory, as is seen from the fact that the 
axioms force themselves upon us as being true” (1964, page 271). He 
goes on to note there that “mathematical intuition need not be conceived 
of as a faculty giving an immediate knowledge of the objects concerned.” 
In 1964 that point is elaborated by reference to Kantian philosophy. But 
at the end of the present remark, Godel puts the matter in a way that 
is supported by the working experience of set theorists who have been 
led to accept axioms of infinity, namely: “These principles show that 
ever more (and ever more complicated) axioms appear during the de¬ 
velopment of mathematics. For, in order only to understand the axioms 
of infinity, one must first have developed set theory to a considerable 
extent.” The implicit but unstated conclusion of all this is that such 
axioms of increasing complexity can be used to settle more and more 
complicated problems of “Goldbach type”. In other words, despite re¬ 
sults such as (*) (or even (j:), if true) “mathematical yes or no questions 
undecidable for the human mind” need not exist, in principle. 

There is one essential difference of aim in the discussions of 1964 and 
of the present remark, concerning the possible utility of axioms of infin¬ 
ity. In 1964, Godel thought that such axioms could be used to decide 
CH, whereas here he aims to use them to solve number-theoretic prob¬ 
lems. The study of the so-called axioms of infinity goes back to Hausdorff 
(1908), followed by several publications by Mahlo (1911, 1912, 1913). 
After that, there was only scattered work in the subject until the late 


°See particularly 1964 ? PP- 264-265 and 271—272. Godel first touched on axioms 
of infinity in footnote 48a of his 1931 and in 1932b. 
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1950s, when it began to undergo intensive development that continues 
to this day. Contrary to Godel’s views, there is no universal agreement 
among those who have studied set theory deeply as to the acceptability 
of these statements as axioms. For very favorable views, see Reinhardt 
1974 or Kanamori and Magidor 1978 (the latter also being a very use¬ 
ful survey paper); for a completely opposite (negative) view, see Cohen 
1971. While it is certainly true that one must do considerable work in 
the subject in order to understand these statements and thus (perhaps) 
be led to accept them, it is not the case that the complexity (in Godel’s 
sense) of the additional axioms has grown enormously, since new and 
stronger axioms simply displace old ones. Thus a few pages suffice to 
formulate the strongest such statements that have been considered, and 
the complexity of the additional axioms is still relatively low. It is true 
that, for each new axiom of infinity A which has been considered and 
which goes beyond a previously accepted S, A “solves” the above Gs, 
simply because S + A establishes the existence of a model for S and 
thus proves the consistency of S (a statement equivalent to Gs). On the 
other hand, Godel’s hope in 1964 that use of axioms of infinity might 
settle CH has simply not been realized. As Martin explains in his report 
on Cantor’s continuum problem, CH has been shown to be undecidable 
relative to any remotely plausible extension of the usual axioms ZFC of 
set theory by axioms of infinity ( Martin 1976, page 86). p The situation 
has not changed at the time of this writing. 

Axioms of infinity are offered by Godel as an example of further ax¬ 
ioms that might help solve previously unsettled problems. In 1964 , page 
265, he suggested that there might be “other (hitherto unknown) ax¬ 
ioms of set theory which a more profound understanding of the con¬ 
cepts underlying logic and mathematics would enable us to recognize 
as implied by these concepts”. But his indication (1964, footnote 23) 
of the nature of such (as stating some kind of “maximum property”) 
is rather vague. Since then a number of other specific axioms have 
in fact been proposed, some of which have been studied intensively by 
set theorists—in particular, the so-called “axiom of determinacy” (see, 
for example, Martin 1976). With respect to the present discussion, 
all these share with the axioms of infinity the following characteristics: 
(i) though they have received some degree of acceptance among set theo¬ 
rists, none of these axioms is widely accepted by the general community 
of mathematicians, (ii) their complexity is relatively low, (iii) they serve 


p The “axiom of constructibility” V = L does prove GCH, as we know from Godel 
1938-194.0, but is incompatible with strong axioms of infinity. It is also seen to be 
intuitively false in the intended interpretation, since it says that all sets are definable 
in a certain way. See the introductory note to 1938 1940 in this volume. 
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to establish previously undecided propositions Gs, and (iv) they do not 
settle CH. 

While the two authors of this note disagree about the foundational 
status of all such proposed new axioms, they agree that—contrary to 
Godel’s view expressed here—there are simple “mathematical yes or no 
problems” which will probably never be settled by the human mind be¬ 
cause they are beyond all remotely feasible computational power and 
provide absolutely no conceptual foothold. For example, we can ask the 
following question: Is it true that if w is the sequence of the first 2 2 
terms in the binary expression of 7T-3, then the last term of w is 0? 

To conclude, it is our view that whatever technical interest there may 
be in such measures of complexity as those offered here by Godel (and 
related ones by Kolmogorov and Chaitin q ), they are irrelevant to the 
experience of working mathematicians. It is not the complexity of the 
axioms needed for solving problems which is at issue in practice, but 
rather the complexity of the proofs required, and here there is no simple 
relationship between results and proofs. Relatively complicated prob¬ 
lems can have relatively simple proofs once the right key is found, while 
relatively simple problems may require amassing an enormous amount 
of (conceptually) complicated machinery in order finally to settle them. 
Moreover, complexity is a shifting matter in the eyes of mathematicians: 
As mathematics develops, previously complicated notions and results 
are assimilated and become everyday tools for the attack on yet more 
difficult problems. A realistic mathematical theory of this common psy¬ 
chological experience has yet to be provided. 

Solomon Feferman and Robert M. Solovay 


Remark 3 

Godel’s 1964 Postscriptum to 1934 began by stressing that the “pre¬ 
cise and unquestionably adequate definition of the general concept of for¬ 
mal system” made possible by Turing’s work allows his incompleteness 
theorems to be “proved rigorously for every consistent formal system 
containing a certain amount of finitary number theory” (Godel in Davis 
1965, page 71 = Godel 1986, page 369). He insisted, however, that such 
generalized undecidability results “do not establish any bounds for the 
powers of human reason, but rather for the potentialities of pure formal¬ 
ism in mathematics” (1986, page 370). Godel was no doubt responding 


q See Chai.tin 1974■ 



Note to 1972a 


293 


here to the claim in Post 1936 that the generality of the incompleteness 
and undecidability theorems for “all symbolic logics and all methods of 
solvability” required that Church’s thesis be seen as “a natural law”, for 
“to mask this identification under a definition hides the fact that a fun¬ 
damental discovery in the limitations of the mathematicizing power of 
Homo sapiens has been made and blinds us to the need of its continual 
verification” (Post in Davis 1965, page 291). But on reflection G5del 
realized that, insofar as they tried to show that “mental procedures can¬ 
not go beyond mechanical procedures” effectively, Turing’s arguments 
for his “unquestionably adequate definition” of computability would im¬ 
ply the same kind of limitation on human reason as claimed by Post, 
and so he wrote this Remark 3 of 1972a on Turing’s “philosophical er¬ 
ror” as a footnote elaborating his disclaimer quoted above. Our problem 
is to understand how Godel could enjoy the generality conferred on his 
results by Turing’s work, despite the error of its ways. Since this clearly 
involves not only his interpretation of Turing’s work but also of Church’s 
thesis generally, a brief review of Godel’s role in the emergence of this 
thesis may help to put some aspects of our problem in perspective. 

In 1934 Godel claimed that the primitive recursive functions used in 
his arithmetization of syntax can all be “computed by a finite proce¬ 
dure”. In a footnote he said that the converse “seems to be true” if 
we allow “recursions of other forms”. Herbrand {1931) admitted arbi¬ 
trary recursion equations (axioms of Group C) into his formalism for 
arithmetic, provided that, “considered intuitionistically, they make the 
actual computation of the fc(x 1,.. .,x n ) possible for every given set of 
numbers, and it is possible to prove intuitionistically that we obtain a 
well-determined result” (Herbrand in van Heijenoort 1967, page 624). 
By this he meant that the computation be carried out informally in 
“ordinary language” and shown constructively to terminate. He also 
claimed that it was impossible to “describe outright” all these /j, since 
otherwise / x (a;)-f 1 would be an “intuitionistically defined function” not 
in the list generated by such a description. He concluded that Godel’s in¬ 
completeness theorems did not hold for his arithmetic, for “to carry out 
Godel’s argument, we have to number all objects occurring in proofs” 
{ibid., page 627); but to number the objects of his formalism one would 
have to focus on “a definite group of schemata” for his recursions, and 
since the diagonal function f x {x ) +1 “cannot be among these functions” 
{ibid.), it could not have any Godel number. This raised the question 
of the generality of his recursions as well as that of Godel’s theorem 
itself. In a letter to Godel he formulated a general but precise notion of 
recursion equations and called “recursive” those functions which are the 
unique solutions of such equations. 

Godel then realized, after rejecting as “thoroughly unsatisfactory” 
Church’s proposal to identify effectiveness with A-definability, that 
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Herbrand’s definition could be modified in the direction of effectiveness, 
and he proposed to call “general recursive” those functions whose values 
could be deduced from his equations by two explicitly stated rules. This 
made his aforementioned footnote suggesting that ‘finite computabil¬ 
ity’ could be identified with a wide enough class of recursions sound 
like Church’s thesis after all. But Godel had qualified this as only a 
“heuristic principle”, and later wrote to Davis that he was not propos¬ 
ing Church’s thesis, but rather 

(GT) The functions “computed by a finite procedure” are those 
definable by “recursions of the most general kind”, 

explaining that in 1934 he was “not at all convinced that my concept 
of recursion comprises all possible recursions” (Godel in Davis 1982, 
page 8) and that the equivalence of his concept, based on Herbrand’s 
equations, with Kleene’s, based on minimalization, “is not quite triv¬ 
ial” (ibid.). Indeed, if the proof of the general recursiveness of all the 
)i-recursive functions by Kleene 1936 were trivial, Godel would presum¬ 
ably not have been in such doubt about the generality of his recursions. 
In fact, this result is behind the “kind of miracle” that Godel (1946) saw 
in the closure of recursiveness under diagonalization, which allowed it to 
provide an “absolute definition” for the “epistemological notion” of com¬ 
putability: given any recursive sequence of general recursive functions, 
Kleene defined by minimalization a new diagonal function rj> x (x) + 1, 
which is nevertheless still general recursive by Kleene’s result. On the 
other hand, Kleene’s normal form theorem shows by explicit arithmetiza- 
tion of the Herbrand-Godel formalism that all the general recursive func¬ 
tions will have Godel numbers, and hence refutes Herbrand’s claim that 
the diagonalization of any “definite group of schemata” for his recursions 
must yield a new recursive function with no Godel number. Adding to 
this Kleene’s general result on “the undecidability, in general, which sys¬ 
tems of equations define recursive functions” (Kleene 1936 = Davis 1965, 
page 248), we cannot but wonder why Kleene’s analysis did not elimi¬ 
nate all Godel’s qualms about replacing the right side of (GT) by general 
recursiveness/ Certainly it showed that one could not reasonably expect 


r Davis (1982) remarks that Kleene’s normal form theorem “must have gone a 
considerable distance towards convincing Godel that his ‘concept of recursion’ indeed 
‘comprises all possible recursions’ ” (p. 11). I have tried to spell out here why Kleene’s 
results should perhaps have done so, but in fact we have no direct evidence that they 
did. Kleene’s paper would even seem to be the plausible source for Godel’s remark in 
1946 about the “miracle” of diagonalization, but it may well be that he never really 
looked at it until after he read and was convinced by Turing’s very different analysis 
of computability. 
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to diagonalize out of this class by any “finite procedure”: For (i) clearly 
effective diagonalizations miraculously do not lead outside this class, and 
(ii) those which do are clearly not effective, since they would depend on 
knowing effectively which sets of Herbrand’s equations lead to his “well- 
determined result”, by GodePs result. s In fact, Kleene shows that in any 
of “certain formal logics” the arithmetization of infinitely many true 
claims of this form will be unprovable, bringing out the importance of 
formally undecidable sentences in justifying (ii). It would thus seem that 
Kleene’s analysis should not only have convinced Godel that his recur¬ 
sions were wide enough to comprise the “finite procedures”, but also pro¬ 
vided him with answers to Herbrand’s argument against the generality of 
his incompleteness theorems. In translating classical into intuitionistic 
arithmetic, Godel ( 1933e ) had used the system of Herbrand 1931 to rep¬ 
resent the former, where his formulation spoke of “the denumerable set 
of function signs f" introduced by Herbrand’s Group C of axioms. 1 But 
in 1934, where he formulated the “conditions that a formal system must 
satisfy” for his incompleteness theorems to apply by asking that it be so 
Godel numbered that its axioms and relation of immediate consequence 
be primitive recursive, he was conspicuously silent about Herbrand’s 
claim that his functions themselves could not be numbered. Why then, 
in view of his own explanation for not having advanced Church’s thesis 
in 1934, did Godel never cite Kleene 1936, either as having helped to 
persuade him of it or to settle his score with Herbrand? 

The answer to the latter question seems to be that Godel was never 
able to satisfy himself with any answer to Herbrand. Indeed, in his 1965 
elaboration of footnote 34 of 1934 he still claimed that the equivalence 
of Herbrand’s notion of a constructively provable recursive function with 
general recursiveness was “a largely epistemological question which has 
not yet been answered”. The question was whether or not the concept of 
a computable function depends on that of an intuitionistic or materially 
correct proof." Godel evidently thought that Turing’s analysis showed 


s Since Herbrand had already stressed in an unsigned note that “all the functions 
introduced must actually be calculable for all values of their arguments by means of 
operations described wholly beforehand” (1971, p. 273), it seems fair to say that the 
concept of general recursiveness was already his (cf. van Heijenoort 1982). The real 
importance of actually writing down such rules as Godel did was that it allowed the 
arithmetization of the entire theory of recursive functions: only then could Kleene’s 
plan to “treat the defining equations formally, as sequences of symbols” (1936, p. 
729 = Davis 1965, p. 239) lead to substantial results. 

* As is pointed out in the introductory note to 1933e, in volume I of these Works, 
Godel’s use of these axioms plays no special role in his proofs. 

u One should consult the introductory note to 1958 and 1972, where the consid¬ 
erable amount of trouble and hard thinking this issue caused Godel are analyzed at 
length. 
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the notion of “mechanical computability” to be independent of such a 
proof concept, and went on to stress in his Postscriptum to 1934, as we 
have seen, that Turing’s analysis also provided an unquestionably ade¬ 
quate definition of a formal system. For the “essence” of such a system, 
says Godel, is to completely replace reasoning by “mechanical operations 
on formulas”, and Turing had shown this to be “equivalent” to a Turing 
machine (1965, page 72). But Godel never uses the word “effective” 
to describe the explicanda of Turing’s analysis, and says explicitly that 
general recursiveness can take its place in (GT) only “if ‘finite proce¬ 
dure’ is understood to mean ‘mechanical procedure’” (ibid., page 73). 
So it is unclear whether any analysis ever convinced him that any of 
these equivalent mathematical concepts comprised the “effective” func¬ 
tions. He stressed that Turing’s analysis is independent of “the question 
of whether there exist finite non-mechanical procedures ... such as in¬ 
volve the use of abstract terms on the basis of their meaning” (ibid.), 
referring to his own 1958 where his abstract concept of a “computable 
function of type t" is used to prove the consistency of arithmetic. Yet he 
also emphasized, as we have seen, that the undecidability results made 
possible by Turing placed no limitation on human reason, but only on 
pure formalism—a point he tries to explain in his Remark 3 of 1972a 
on Turing’s “philosophical error” of assuming that a human computer 
would be capable of only finitely many distinguishable mental states. To 
understand how Godel could allow an analysis based on such an error 
to stand as “unquestionably adequate” to establish the generality of his 
theorems, we turn to Turing’s work. 

In a personal communication to the author, Feferman has plausi¬ 
bly conjectured that the basic new feature of Turing’s machines which 
convinced Godel of their adequacy for defining a general concept of 
“mechanical procedure” was the deterministic character of their com¬ 
putations, since this automatically ensured the consistency of Turing’s 
definition of computability. In Church’s A-calculus the calculation of a 
normal form for a term representing a function may take many different 
courses, and the Church-Rosser theorem on the uniqueness of exist¬ 
ing normal forms, which ensures the consistency of Church’s definition, 
was indeed, as Feferman (1984a) points out, “exceptionally difficult” 
to follow—to say nothing of the ontological obscurity surrounding the 
A-calculus. This may also explain why Kleene 1936 failed to completely 
persuade Godel. v While this feature alone of Turing’s analysis may 


v See Kleene 1943, where the analogous problems of consistency for various for¬ 
malisms for recursive functions not mentioned in 1936 are discussed fully. In particu¬ 
lar, Kleene points out that the consistency proof for one formalism of partial recursive 
functions “seems to require the type of argument used in the Church-Rosser consis¬ 
tency proof for A-conversion.” 
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explain why Godel found it more convincing than Church’s and Kleene’s, 
we have to look more closely at Turing’s arguments to see how Godel 
might have acquiesced in its advantages despite their error. 

In Turing 1937 (reproduced in Davis 1965, pages 116 149), Turing’s 
arguments for the thesis that his machines could compute any function 
“calculable by finite means” are divided into three types. Type I presents 
his analysis of the operations an ideal “human computer” could perform 
and depends on the assumption, questioned by Godel, that he is capable 
of only finitely many “states of mind”. Type II shows that the entire 
deductive apparatus of the predicate calculus can be simulated by one of 
his machines. Type III is a “modification” of the type I argument that 
replaces the problematic notion of a state of mind by “a more phys¬ 
ical and definite counterpart of it” (Turing in Davis 1965, page 139), 
namely, a “note of instructions” enabling his computer “to break off 
from his work” and later resume it. Since each such note “must enable 
him to carry out one step and write the next note” (ibid.), it follows 
that each stage of his computation is “completely determined” by such 
notes. Turing argues that, since the instantaneous “state of the system” 
comprised of a note of instructions and tape symbols can be represented 
by single expressions, the entire computation of his computer could be 
formalized in the predicate calculus, and therefore, by the type II ar¬ 
gument, carried out by one of his machines. Thus Feferman suggests 
that Godel rejected only Turing’s type I argument, while accepting his 
“more physical” type III argument. Indeed, Wang (1974, page 326) 
reports that in discussions with Godel about this remark, Godel admit¬ 
ted the validity of Turing’s argument under two additional assumptions: 
(1) “There is no mind separate from matter,” and (2) “The brain func¬ 
tions basically like a digital computer,” or (2') “The physical laws, in 
their observable consequences, have a finite limit of precision”. Although 
Godel accepted both (2) and (2'), he rejected (1) as “a prejudice of 
our time” which would eventually be scientifically refuted, possibly by 
showing that “there aren’t enough nerve cells to perform the observable 
operations of the mind” (ibid.). Godel believed that Turing’s argument 
depended on some form of physicalism, and indeed Turing says of the 
elementary operations of his human computer that “every such oper¬ 
ation consists of some change of the physical system consisting of the 
computer and his tape” (Turing in Davis 1965, page 136). But since 
there is no doubt that such “observable” mental operations as reading 
symbols can be performed by one’s nerve cells, it is not yet clear what 
is lost in Turing’s physicalist analysis of computability, or even that it 
really depends on assumption (1). Moreover, since the physicalist claim 
just quoted occurs in the type I argument, it is clear that in analyzing 
Godel’s critique of Turing we cannot simply ask which of the three types 
of argument he may have found valid or invalid. Indeed, a closer look at 
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Turing’s paper will show, I believe, that he basically presented just one 
argument, one which is still plausible without any physicalist premises, 
but also one that Godel could accept only under his own interpretation 
on the conclusion of Turing’s argument, that is, on Turing’s thesis. Let 
us examine this more closely. 

For Turing the question of what can be done “effectively” concerns 
memory, though it can only scan one symbol at a time, by altering its 
state “the machine can effectively remember some of the symbols which 
it has ‘seen’ (scanned) previously” (ibid., page 117). The restriction to 
a finite number of states thus limits its memory, and Turing justifies its 
comparison with a human computer by reference to “the fact that the 
human memory is necessarily limited” (ibid.). How else is one to repre¬ 
sent this limitation except in terms of the number of states? Since Godel 
accepted such a limitation on the brain, it seems that he may have en¬ 
visaged, as did Leibniz, some kind of purely mental memory, “separate” 
from that of the brain. But this would still not undermine Turing’s type 
I argument, for it does not deny the existence of an infinity of mental 
states, much less assume (1), but argues rather that “if we admitted 
an infinity of states of mind, some of them will be ‘arbitrarily close’ 
and will be confused” (ibid., page 136). Turing’s point is that only an 
effectively distinguishable set of states could be used to “effectively re¬ 
member” symbols, and hence to effectively compute. Godel admits that 
this set is finite even for the mind in its current stage of development, 
but envisaged the possibility of “systematic methods” for so actualizing 
the development of our understanding of abstract terms that it would 
“converge” to infinity. w 


w That Turing’s finiteness hypothesis is perfectly compatible with a “dynamic” 
view of mind, however, emerges clearly when we consider the suggestion that it was 
Turing’s type III argument that convinced Godel (that “mechanically computable” 
functions are Turing-computable). This argument actually contains a sketch for a 
rather different but still direct formulation of the computability idea, indeed the very 
one simultaneously and independently worked out in Post 1936. (See Hodges 1983 
for some interesting remarks on this.) Here the “memorial” role of states, that they 
depend on previous states and scanned symbols, is played by instructions so num¬ 
bered that they can refer to each other. The equivalence of Turing’s computability 
and Post’s is thus essentially contained in Turing’s type III argument itself: it can 
be regarded as essentially a sketch for a proof that his machines could simulate any 
of Post’s finite 1-processes. Turing’s restriction of his human computer to finitely 
many states is thus equivalent to Post’s restriction of his “worker”, who executes the 
same atomic operations, to finitely many instructions. Post (1941) saw his restric¬ 
tion as evident from the fact that “the system of symbolizations ... is essentially to 
be a human product and each symbolization a human way of describing the original 
mathematical state” (p. 427). He emphasized the dualism between “the static outer 
symbol-space” and our “dynamic mental world” which nevertheless has “its obvious 
limitations”, ones that are fully emphasized by Turing’s “finite number of mental 
states” hypothesis (p. 431). 
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But does it follow, as Godel claimed, that from the convergence to 
infinity of the number and precision of our abstract terms the number 
of distinguishable states of mind must so converge? Turing might say: If 
we admit an infinity of abstract terms some of them will be “confused” 
with each other. Godel would say: Not if each of them is understood 
“precisely”. This assumption, that coming to understand such a term 
precisely always creates a distinct new mental state for using it, not 
only depends on the special significance Godel attaches to “abstract” 
terms, but also shows that he has in mind a notion of “state” different 
from Turing’s. This seems clear from the superficial similarity of Godel’s 
argument to the familiar fallacious one that, since the mind can in prin¬ 
ciple “think” of any natural number, it must be capable of an infinity 
of distinct states and hence not be a Turing machine. This rests on the 
false assumption that we need to be in distinct single states to think of 
distinct numbers: Since “thinking of a number” in general involves sym¬ 
bolic calculation, it is best analyzed precisely as Turing does as a process 
of passing through a finite sequence of states, each carrying a bounded 
record of the history of the calculation. Indeed, in a penetrating analysis 
of Turing’s arguments, Gandy replaces his finiteness condition on states 
by a “principle of local causation” excluding any instantaneous action at 
a distance in the causal relations between successive states, and conjec¬ 
tures that Godel’s “non-mechanical intelligence would, so to speak, see 
the state rasa Gestalt , and by abstract thought make global determi¬ 
nations which could not be got at by local methods” ( Gandy 1980, page 
146). By Turing’s own lights, Godel’s intelligence would have an essen¬ 
tially omniscient memory. But, as Gandy’s formulation reminds us, it 
is the abstractness of the terms more precisely understood by Godel’s 
developing mind that is supposed to make its states different from those 
relevant to Turing’s human computer and which also distinguishes his 
argument from the familiar fallacious one for an infinity of mental states. 
When we note, furthermore, that it is the complexity of abstract terms 
that distinguishes them from concrete ones by Godel’s lights, his argu¬ 
ment can be seen as more than a mere formulation of the possibility that 
we might in the unforseeable future develop an “infinite mind” , x 

We see this in fact from Godel’s Remark 2 of 1972a on “another ver¬ 
sion” of his first theorem, where he takes it as showing that to solve 
even relatively simple problems we shall need ever new axioms of “great 
complication”. His favorite examples are the stronger axioms of infinity 


x This phrase was used by Godel in 1944 to criticize the attempt by Ramsey 
(1926) to reduce classes to infinitely long propositions. What else could an infinite 
truth-function be, asks Godel, but another infinite structure more complicated than 
classes, “endowed in addition with a hypothetical meaning, which can be understood 
only by an infinite mind”? (1944, P- 142). 
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which are “evident” even though “ever more complicated” because they 
assert the existence of increasingly abstract and complicated objects. In¬ 
deed, Godel’s argument against Turing must hinge on this assumption 
about abstract terms: that by understanding them more precisely we 
become capable of states which are themselves more and more compli¬ 
cated. In fact, immediately after his claim quoted above that we could 
effectively distinguish only finitely many of a supposed infinity of states, 
Turing had admitted that this restriction “is not one which seriously 
affects computation, since the use of more complicated states of mind 
can be avoided by writing more symbols on the tape” ( Davis 1965, page 
136). So even in Turing’s own mind the issue was more the complexity 
of the states than their actual infinity, and indeed his construction of the 
universal machine comprised the most striking confirmation of this kind 
of ‘compensation’ for the lack of complicated states. Here the ‘cybernet- 
ical aspect’ of the enumeration theorems comes to the fore: to simulate 
machines with arbitrarily more and more states which are more and 
more complicated, the universal machine, with its fixed finite number of 
states, has only to be given their Godel number on its tape, and then, 
as Turing later put it, “the complexity of the machine to be imitated is 
concentrated in the tape and does not appear in the universal machine 
proper in any way” (Hodges 1983, page 320). Godel was presumably not 
convinced, the universal machine notwithstanding, that all the states 
entered by a human computer using “finite non-mechanical procedures” 
could always be compensated for in Turing’s purely symbolic manner, 
for in such states it just might exploit the meanings of ever more abstract 
concepts of proof and infinity to grasp infinitely complicated combinato¬ 
rial relations. It is really this kind of possibility more than any conver¬ 
gence to an infinity of states that could undermine Turing’s arguments, 
but , far from having disregarded it completely, it seems that Turing him¬ 
self must have initially thought such an objection plausible; yet once he 
discovered the universal machine he saw that it could indeed compen¬ 
sate symbolically for a surprisingly wide class of increasingly complicated 
machine states. y Otherwise he would never have claimed that “a man 


y The critique of Turing’s analysis of computability in Kreisel 1972 is closely re¬ 
lated to Godel’s, with whom Kreisel agrees that “Turing’s error” of assuming that 
a human computer can enter only finitely many distinguishable states “does not 
invalidate his analysis of mechanical instructions” (p. 318). His second error, “not 
unrelated” to “the petitio principii concerning the finiteness of our thinking” already 
opposed by Cantor, is “assuming that the basic relations between (finite) codes of 
mental states must themselves be mechanical” (p. 319). That is, the succession of 
mental states in human computation cannot be described by “rudimentary” func¬ 
tions as can those of Turing machines. This suggests to Kreisel that “the sequence of 
steps needed to execute an h-effective definition” cannot be reproduced by such ma¬ 
chines. In the case of number-theoretic functions, “we can say loosely that the human 
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provided with paper, pencil, and rubber, and subject to strict discipline, 
is in effect a universal machine” (Turing 1970, page 9). 

That Turing’s mechanical model of the human computer is by no 
means unduly simple is brought out by another version of Godel’s the¬ 
orem due to Kleene: In every consistent formal system F in which the 
universal machine U can be described, there are infinitely many ex¬ 
pressible facts about IT s halting behavior which cannot be proved in F. 
Since F may have as axioms, for example, any effectively specifiable set 
of strong axioms of infinity, it would seem that U must be rather com¬ 
plicated after all, despite the seeming simplicity of its states. That this 
point was not lost on Godel is clear from a letter he wrote Arthur Burks, 
who had queried him about von Neumann’s notion of an automaton so 
complex that its behavior was “asymptotically [?] infinitely longer to 
describe” than the automaton itself. Godel replied (as quoted in von 
Neumann 1966, page 56) that 

what von Neumann perhaps had in mind appears more clearly 
from the universal Turing machine. There it might be said 
that the complete description of its behavior is infinite be¬ 
cause, in view of the non-existence of a decision procedure 
predicting its behavior, the complete description could be 
given only by an enumeration of all instances ... The univer¬ 
sal Turing machine, where the ratio of the two complexities 
is infinity, might then be considered to be a limiting case 
of other finite mechanisms. This immediately leads to von 
Neumann’s conjecture. 

He added, however, that this presupposes “the finitistic way of think¬ 
ing” about descriptions, so he was presumably still not willing to concede 
that U poses an absolutely unsolvable decision problem, even by finite 
non-mechanical procedures. But his Remark 3 of 1972a admits that 


computations are more ‘complicated’ or, better, more abstract than the objects on 
which they operate—our thoughts may be more complicated than the objects thought 
about” (ibid., p. 320). This is the same connection between abstractness and com¬ 
plexity that we find in Godel’s argument. Kreisel errs, however, when he claims “in 
the case of (Turing) machines whose states are finite spatio-temporal configurations 
it is quite clear how to code states by natural numbers” (p. 319), as if only their 
“finite spatio-temporal” character had allowed Turing to code them. But his coding 
of machine states depends only on the fact that they are finite in number and has 
nothing to do with such properties. We agree with Kreisel that the coding of purely 
mental states should be “a more delicate matter”. But the only property of such 
states he uses is that a human computer may think about proofs in Heyting’s formal 
arithmetic, and his examples of prima facie non-mechanical rules depending on such 
states have turned out to be Turing-computable. Still, Kreisel’s discussion does en¬ 
gage more explicitly than Godel’s argument the problem Turing himself saw at issue 
in this hypothesis, namely, the ‘complexity’ of states, and this is its virtue. 
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our development of such procedures is still eons away from being able to 
“actually carry out” the computations needed to predict the behavior of 
Turing’s model of the human computer. 

Finally, we note that the interpretation of Godel’s argument against 
Turing as depending on the complexity of the states entered by a hu¬ 
man computer using “finite non-mechanical procedures” does not con¬ 
flict with his claim that the adequacy of the latter’s analysis of “mechan¬ 
ical procedure” has nothing to do with the existence of such procedures. 
Still, this claim does imply a certain interpretation of his own of Tur¬ 
ing’s work, namely, that all Turing was really analyzing was the concept 
of “mechanical procedure”, but that in his arguments for the adequacy 
of his analysis he overstepped himself by dragging in the mental life of 
a human computer. As Wang’s authorized formulation of Godel’s view 
puts it, “we had not perceived the sharp concept of mechanical proce¬ 
dures sharply before Turing, who brought us to the right perspective” 
(1974, page 85). This sounds plausible enough until we look at what Tur¬ 
ing actually said. As we saw, Turing offers (in Section 9 of his paper) 
three “types of argument” for the thesis that his machines can compute 
the decimal expansion of any real number which is “calculable by finite 
means”. Godel took issue with his type I argument, though I believe 
that none of them can hold by his lights (see footnote w above). Per¬ 
haps in reflecting on this argument in 1972, Godel forgot that “the word 
‘computer’ here meant only what that word meant in 1936: a person 
doing calculations” ( Hodges 1983 , page 105). Where then did he find 
Turing’s “analysis” of mechanical operations shorn of human aspect? 
Really, nowhere: as Turing himself says, his type I argument is “only an 
elaboration of the ideas” presented in Section 1 of his paper, and his type 
III argument “may be regarded as a modification of I or as a corollary of 
II”. In fact, Turing has one basic argument, which is presented in Sec¬ 
tion 1 and discussed above, and whose central premise is “the fact that 
the human memory is necessarily limited”. Turing refers to this “fact” 
as the “justification” of his definition of the computable real numbers 
in terms of his machines. The heart of his argument was a novel ab¬ 
stract logical analysis of what it means to “effectively remember” things 
relevant to computation, such as symbols or how many times one has 
executed a subroutine: to do so one must be able to change from one dis¬ 
tinguishable state to another, whether you are human or a machine. We 
presume indeed that “states of mind” may also carry memories beyond 
the wildest dreams of machines, but the only ones relevant to effective 
computation are those you are put into by symbols and processes arising 
in the course of the computation. But our memory is just as “necessarily 
limited” as a machine’s—in either case, to a finite number of recognizable 
state changes. We saw, however, that a crucial problem does surface in 
Turing’s “elaboration” of this limitation, namely, the apparent need to 
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represent “more complicated” states as their number increases. Here a 
prima facie difference between a human computer and a Turing machine 
presented itself to Turing as soon as he wrote down the definition of the 
latter: for then Turing himself could number his machines like Godel 
and define a universal partial function which was obviously effective for 
him, whereas it could not have been obvious from his definition that one 
of his machines could compute it. Perhaps he could compute it only be¬ 
cause he entered more complicated states enabling himself to simulate 
machines with ever more complicated states. In any case, his discov¬ 
ery of the universal machine, compensating for an internal memory of 
bounded complexity by an external one of unbounded size, is really his 
main contribution to science, having already made possible a better grip 
on a basic concept which we are still far from seeing “sharply”, namely, 
that of ‘complexity’. 

We now try to focus the questions raised by Godel’s view of the sig¬ 
nificance of Turing’s analysis and his own results. The mere existence of 
undecidable sentences in formalisms like Principia mathematica, while 
interesting, would not of itself force one to reexamine the scope of for¬ 
malization in mathematics. There would simply be axioms one had 
overlooked. The full force of the results of Godel and Church is only 
made explicit in Rosser’s 1936 extension: 

(T) The set of improvable sentences of any adequate formalism 
is not r.e. 

Godel saw this as no limitation on human reason but only “pure formal¬ 
ism in mathematics”—in opposition to Post, who regarded Church’s the¬ 
sis as a hypothesis about the significance o/(T), namely, that it implied 
that the decision problem for any formalism containing such sentences 
was absolutely unsolvable by any effective method. Godel had been sure 
only of (GT) until Turing discovered the first completely deterministic, 
and hence obviously consistent, formulation of computability. But even 
when Turing computability proved equivalent to both A-definability and 
general recursiveness (and the latter to ^-recursiveness), he still insisted 
that the latter comprised the “most general recursions” of (GT) only if 
“finite procedure ” was interpreted as “mechanical procedure ”—but not 
if it referred to what a human computer could “effect”. For this, one 
needed, according to Godel, the assumption that such a computer was 
a completely physical system, an assumption he could not accept. He 
saw clearly that Timing’s analysis, if allowed to stand for humanly effec¬ 
tive processes generally, would imply that human minds are not more 
effective than machines in dealing with (T). Hence he looked for “sys¬ 
tematic methods” for developing our use of abstract notions of infinity 
and proof to the point of being able to handle such problems in construe- 
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tive but non-mechanical ways. But his example of looking for stronger 
axioms of infinity, while it may well actualize and sharpen our ability to 
use abstract concepts, is not necessarily a process that would actually 
increase the number of our distinguishable mental states. It might be 
argued that when such axioms “force themselves on us as being true”, 
we enter meaning-using states which, in some as yet undefined sense, 
are more complex than those of any machine; but this would seem to 
depend on Godel’s Platonism. His other example of “the process of sys¬ 
tematically constructing” all the recursive ordinals suggests that he may 
have contemplated the study of Turing’s ordinal logics in search of non¬ 
mechanical but effective ways of overcoming (T). Clearly he was under 
no illusions about the prospects that either approach could lead to the 
construction of a sufficiently “well-defined procedure” to refute Turing’s 
analysis. I conclude that Godel saw the difficulty of interpreting (T) 
in a way that both preserved the generality of his incompleteness theo¬ 
rems and avoided a mechanistic hypothesis to the effect that humanly 
effective processes are mechanizable. The degree of this difficulty can 
perhaps be seen by how far into the future of human development he 
felt he had to look for a way out of it. 


Judson C. Webb 2 



Some remarks on the undecidability results 
{1972a) 


1. The best and most general version of the unprovability of consistency 
in the same system. 1 Under the sole hypothesis that Z (number theory) is 
recursively one-to-one translatable into S, with demonstrability preserved 
in this direction, the consistency (in the sense of non-demonstrability of 
both a proposition and its negation), even of very strong systems S, may 
be provable in S, and even in primitive recursive number theory. However, 
what can be shown to be unprovable in S is the fact that the rules of 
the equational calculus applied to equations demonstrable in S between 
primitive recursive terms yield only correct numerical equations (provided 
that S possesses the property which is asserted to be unprovable). Note 
that it is necessary to prove this “outer” consistency of S (which for the 
usual systems is trivially equivalent with consistency) in order to “justify” 
the transfinite axioms of a system S in the sense of Hilbert’s program. 
(“Rules of the equational calculus” in the foregoing means the two rules of 
substituting primitive recursive terms for variables and of substituting one 
such term for another one to which it has been proved equal.) 

This theorem remains valid for much weaker systems than Z. With 
insignificant changes in the wording it even holds for any recursive trans¬ 
lation of the primitive recursive equations into S. 

2. Another version of the first undecidability theorem. The situation 
may be characterized by the following theorem: In order to solve all prob¬ 
lems of Goldbach type of a certain degree of complication k one needs a 
system of axioms whose degree of complication, up to a minor correction, 
is > k (where the degree of complication is measured by the number of 
symbols necessary to formulate the problem (or the system of axioms), 
of course with inclusion of the symbols occurring in the definitions of the 
non-primitive terms used). Now all of present day mathematics can be 
derived from a handful of rather simple axioms about a very few primitive 
terms. Therefore, even if only those problems are to be solvable which can 
be formulated in a few pages, the few simple axioms being used today will 
have to be supplemented by a great number of new ones or by axioms of 
great complication. It may be doubted whether evident axioms in such 
great numbers (or of such great complexity) can exist at all, and therefore 
the theorem mentioned might be taken as an indication for the existence 
of mathematical yes or no questions undecidable for the human mind. But 


1 This has already been published as a remark to footnote 1 of the translation (1967, 
p. 616) of my 1931, but perhaps it has not received sufficient notice. 
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what weighs against this interpretation is the fact that there do exist un¬ 
explored series of axioms which are analytic in the sense that they only 
explicate the content of the concepts occurring in them, e.g., the axioms 
of infinity in set theory, which assert the existence of sets of greater and 
greater cardinality or of higher and higher transfinite types and which only 
explicate the content of the general concept of set. These principles show 
that ever more (and ever more complicated) axioms appear during the de¬ 
velopment of mathematics. For, in order only to understand the axioms of 
infinity, one must first have developed set theory to a considerable extent. 

3. A philosophical error in Turing’s work. 2 Turing in his 1937, page 
250 (1965, page 136), gives an argument which is supposed to show that 
mental procedures cannot go beyond mechanical procedures. However, 
this argument is inconclusive. What Turing disregards completely is the 
fact that mind, in its use, is not static, but constantly developing, i.e., 
that we understand abstract terms more and more precisely as we go on 
using them, and that more and more abstract terms enter the sphere of 
our understanding. There may exist systematic methods of actualizing 
this development, which could form part of the procedure. Therefore, al¬ 
though at each stage the number and precision of the abstract terms at 
our disposal may be finite, both (and, therefore, also Turing’s number of 
distinguishable states of mind) may converge toward infinity in the course 
of the application of the procedure. Note that something like this indeed 
seems to happen in the process of forming stronger and stronger axioms of 
infinity in set theory. This process, however, today is far from being suffi¬ 
ciently understood to form a well-defined procedure. It must be admitted 
that the construction of a well-defined procedure which could actually be 
carried out (and would yield a non-recursive number-theoretic function) 
would require a substantial advance in our understanding of the basic con¬ 
cepts of mathematics. Another example illustrating the situation is the 
process of systematically constructing, by their distinguished sequences 
a n —> a, all recursive ordinals a of the second number-class. 


2 This remark may be regarded as a footnote to the word “mathematics” on page 73, 
line 3, of my 1964 postscript to Godel 1965. 



Introductory note to 1974 


“In this test, however, the infinitely small has completely failed.” 3. 
The test was the foundation of the differential and integral calculus; the 
author was Abraham Fraenkel (1928, page 116); the view expressed was 
the canonical one at the time. 

But infinitesimals refused to go away; by a rather circuitous route 
they have re-emerged as a part of a viable foundation for mathematical 
analysis. This was achieved by Abraham Robinson, using the construc¬ 
tion of non-standard models by logical means that had originally been 
introduced by Skolem (1933a, 1934)- In Robinson’s own words (1966, 
page vii): 

In the fall of 1960 it occurred to me that the concepts and meth¬ 
ods of contemporary Mathematical Logic are capable of providing 
a suitable framework for the development of the Differential and 
Integral Calculus by means of infinitely small and infinitely large 
numbers. I first reported my ideas in a seminar talk at Princeton 
University (November 1960) .... 

The influence of Skolem’s work was explicitly acknowledged by Robin¬ 
son. But whereas Skolem’s aim had been deeply negative, namely to 
show the limitation of axiomatic foundations, Robinson was able to turn 
the non-standard method to positive advantage by providing a new, effi¬ 
cient and rigorous technique for the use of infinitesimals in mathematical 
analysis. 

There had been previous attempts in the same direction since Leibniz’ 
time, but none achieved satisfactory levels of rigor. b Robinson succeeded 
by bringing modern logical notions and results to bear on the problem; 
by these means one can explain exactly which properties transfer from 
the standard structure of reals to the non-standard structure with in¬ 
finitely large and infinitely small numbers. The non-standard extension 
he constructed is, first of all, an elementary extension in the sense of 
model theory and, secondly, one rich in points in the sense that any (in¬ 
ternal) family of sets with the finite intersection property has a common 
point (that is, a point belonging to all the sets of the family). 

The first of these properties is called the transfer principle, and it 


a “Bei dieser Probe hat aber das Unendlichkleine restlos versagt.” 
b The history of attempts at a theory of infinitesimals from Leibniz to Robinson 
cannot be dealt with in this brief note; for some reviews of it, see Robinson 1966 and 
Laugwitz 1978. 
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guarantees that a non-standard extension has the same “algebra” as the 
reals. The second property is called the concurrence principle or sat¬ 
uration-, it is a very important uniformity principle, which lies behind 
many mathematical arguments, and has the form of a transition from a 
statement with quantifier structure V3 (to express the “local” property 
of finite intersection) to one of the form 3V (to express the “global”, or 
uniform, property). This is a principle which is at the heart of many 
finiteness, compactness and uniform-boundedness arguments. 

Godel forcefully expressed his views on the importance of Robinson’s 
work in some remarks following a talk by' Robinson at the Institute for 
Advanced Study in March 1973. Godel’s statement on that occasion was 
reproduced in Robinson 1974, page x, with Godel’s permission, and has 
been extracted here as Godel 1974 • In these remarks Godel noted that 
non-standard analysis “frequently simplifies substantially the proofs”, 
that it is not a “fad of mathematical logicians”, and further that “there 
are good reasons to believe that non-standard analysis, in some version 
or other, will be the analysis of the future”. 

Godel further remarked that it is “a great oddity” that the “natural 
step after the reals, namely the introduction of infinitesimals, has simply 
been omitted”. He linked this to another “oddity” of modern mathe¬ 
matics, “namely the fact that such problems as Fermat’s, which can be 
written down in ten symbols of elementary arithmetic, are still unsolved 
300 years after they have been posed”. He saw a reason for this failure 
in the enormous concentration on the development of abstract mathe¬ 
matics, while work on concrete numerical problems was neglected. 

At first sight one may indeed wonder why “the next quite natural 
step” after the reals was not taken sooner. From any one of several 
points of view, the status of the non-standard system of (hyper)reals 
may be considered to be on a par with that of the standard system of 
real numbers; the case for that was put as follows in Robinson 1966 
(page 282): 

Whatever our outlook and in spite of Leibniz’ position, it appears 
to us today that the infinitely small and infinitely large numbers 
of a non-standard model of Analysis are neither more nor less 
real than, for example, the standard irrational numbers. This 
is obvious if we introduce such numbers axiomatically; while in 
the genetic approach both standard irrational numbers and non¬ 
standard numbers are introduced by certain infinitary processes. 
This remark is equally true if we approach the problem from the 
point of view of the empirical scientist. For all measurements are 
recorded in terms of integers or rational numbers, and if our the¬ 
oretical framework goes beyond these then there is no compelling 
reason why we should stay within an Archimedean number system. 
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Godel’s remarks suggest that he would not have disagreed with this 
statement of Robinson’s. But their views would probably have diverged 
on the question of the ontological status of the “new” numbers. For, 
Robinson—who on several occasions expressed a strong formalist con¬ 
viction (see also his 1965 and 1975) —goes on to say in 1966: “From a 
formalist point of view we may look at our theory syntactically and may 
consider that what we have done is to introduce new deductive proce¬ 
dures rather than new mathematical entities.” 

In contrast, Godel’s remarks here and his general Platonist position 
would lead him to hold that there is no ontological difference between 
the integers, the rationals, the standard irrationals, and the infinitesi¬ 
mals: by a series of “quite natural steps” we may become familiar with 
and gain insight into what already exists. 

Beyond these obvious points of agreement and disagreement, Godel’s 
remarks contain some provocative statements and phrases, which de¬ 
serve to be singled out for special comment. 

11 The next quite natural step”: The use of the definite article suggests 
that adding infinitesimals results in a unique extended number system. 
This is not so in Robinson’s non-standard analysis. One may enforce 
uniqueness by somewhat arbitrary restrictions, for example, by requir¬ 
ing the extension to be Ki-saturated and of power Hi, but few will be 
satisfied by such a move. In fact arguments can be made that non¬ 
uniqueness is not a feature to be criticized but an opportunity to be 
exploited; see Fenstad 1985. 

"A great oddity in the history of mathematics”: Could the “next 
step” really have been taken earlier? It seems to me that the success 
of Robinson’s non-standard analysis presupposes in an essential way an 
understanding of the notions of elementary extension and concurrence. 
This is why Skolem’s work was such an important influence; perhaps 
Robinson should have acknowledged at the same time the importance 
of the works of A. Maltsev (1936 and 1941), in which compactness ar¬ 
guments (giving concurrence) are used for the first time. 

The reals and the infinitesimals were used in the early development 
of the integral and differential calculus and in its applications to the 
physical sciences. The reals were then tamed in the latter part of the 
last century (by Weierstrass, Dedekind and Cantor). Infinitesimals were 
more troublesome and were at first banned (“restlos versagt”). An in¬ 
sight in mathematical logic was necessary for their taming; so, contrary 
to Godel, it seems that the next step really had to wait its time. 

“Another oddity”: The intended meaning of Godel’s remarks is prob¬ 
lematic; for example, the recent (partial) success on the Fermat prob¬ 
lem due to G. Faltings (1983) proceeds via an “enormous develop¬ 
ment of abstract mathematics”. Perhaps Godel meant to hint at some 
incompleteness phenomenon and the necessity of new axioms to solve 
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concrete problems of mathematics. Such examples are known; but one 
should not forget that, when presented axiomatically, Robinson’s non¬ 
standard analysis is a conservative extension of the standard theory 
(.Kreisel 1969). 

“ The analysis of the future ”: Here one can add the following point 
of view in support of Godel’s “good reasons”. If we take seriously the 
idea that the informally understood geometric line can support point 
sets richer than the standard reals and that one non-standard extension 
is but one way of “constructing” points in this extended continuum, 
then we have in hand a framework for a geometric analysis of physical 
phenomena on many, even infinitesimal, scales, including physical phe¬ 
nomena that are too singular to fit in a direct way into the standard 
frame. In such a geometric analysis, infinitesimals appear not merely 
as a convenience in “simplifying proofs” but as an essential notion in 
the very description of the phenomena in question. There is a growing 
body of new results in the Robinson non-standard analysis exploiting 
this point of view; see Cutland 1983 and Albeverio et alii 1986 for some 
representative samples. 


Jens Erik Fenstad 



I Remark on non-standard analysis 1 

(1974 ) 


I would like to point out a fact that was not explicitly mentioned by 
Professor Robinson, but seems quite important to me; namely that non¬ 
standard analysis frequently simplifies substantially the proofs, not only 
of elementary theorems, but also of deep results. This is true, e.g., also 
for the proof of the existence of invariant subspaces for compact operators, 
disregarding the improvement of the result; and it is true in an even higher 
degree in other cases. This state of affairs should prevent a rather common 
misinterpretation of non-standard analysis, namely the idea that it is some 
kind of extravagance or fad of mathematical logicians. Nothing could be 
farther from the truth. Rather, there are good reasons to believe that 
non-standard analysis, in some version or other, will be the analysis of the 
future. 

One reason is the just mentioned simplification of proofs, since simpli¬ 
fication facilitates discovery. Another, even more convincing reason, is the 
following: Arithmetic starts with the integers and proceeds by successively 
enlarging the number system by rational and negative numbers, irrational 
numbers, etc. But the next quite natural step after the reals, namely the 
introduction of infinitesimals, has simply been omitted. I think in coming 
centuries it will be considered a great oddity in the history of mathematics 
that the first exact theory of infinitesimals was developed 300 years after 
the invention of the differential calculus. I am inclined to believe that this 
oddity has something to do with another oddity relating to the same span 
of time, namely the fact that such problems as Fermat’s, which can be 
written down in ten symbols of elementary arithmetic, are still unsolved 
300 years after they have been posed. Perhaps the omission mentioned is 
largely responsible for the fact that, compared to the enormous develop¬ 
ment of abstract mathematics, the solution of concrete numerical problems 
was left far behind. 
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Textual notes 


All of Godel’s articles printed here were previously published, except 
for 1972 and 1972a. The copy-text of each work, i.e., the version printed 
in this volume, is the first published version of the text, except in the 
case of 1940 where the 1970 printing has been used. The articles 1972 
and 1972a occur as galley proofs in Godel’s Nachlass. The copy-text of 
various papers has been emended to incorporate his later alterations, and 
these are indicated either in the textual notes below or by single square 
brackets [ ] in the text. Likewise, editorial additions or corrections are 

indicated either by textual notes or by double square brackets [ J in the 
text. (Minor editorial changes in punctuation have not been recorded in 
the textual notes.) In these notes, the pairs of numbers on the left indicate 
page and line number in the present volume. 

All articles in this volume were written in English except for 1958, which 
is printed here in German and in an English translation; see the textual 
notes under 1958 for a discussion of the translation. Abbreviations in 
English (such as Cont. Hyp., Cor., def., langu., prop, funct., math., resp., 
Th.) and in Latin (such as ad inf.) have been silently expanded throughout, 
except for p. and pp. in footnotes and for standard abbreviations (such as 
cf., e.g., etc., i.e.). 

Godel’s occasional British spelling has been changed to American spelling 
in those papers where British spelling occurs, namely 1940, 1949a and 1972. 

The original pagination for all previously published texts is indicated 
by a page number in the margin, with a vertical bar in the text indicating 
where the page begins. The first page number is always omitted. 


Godel 1938 

An offprint of this article in Godel’s Nachlass has the following correc¬ 
tion. 

Original Replaced by 

26, 5-6 Axiom III3* i.e., replacing Axiom III3* by 

Axiom III3 

Godel 1939 

Replaced by 
is the set 


Original 

27, 19 is a set 
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Godel 1939a 


Godel noted corrections in the usi 

e of a in 1939a when he published 194 7, 

and these 

have been incorporated i 

n the text; see the textual notes below 

to 1947. ' 

The two lines following Theorem 4 were mistakenly set in italics 

in the original printing, and are noi 

v set in roman. 


Original 

Replaced by 

29, 9ff 

a 


29, 28 

follows 

This follows 

32, 14 

it follows 

there follows 


Godel 1940 

The copy-text is the 1970 printing of 1940 , which includes notes that 

Godel added in 1951 and in 1965. 

To improve the readability of proofs, 

a comma 

or semi-colon has been introduced from time to time. Likewise, 

commas are inserted between different members of an ordered pair. Axiom, 
definition, lemma and theorem have been capitalized whenever they had a 

number in 1940. 



Original 

Replaced by 

35, 24 

(x){y)(3z) 

(*)(y)(3*)(«) 

38, 8 

(*)(3tf) 

(*)(3»)(«) 

39, 16 

the axiom E 

Axiom E 

39, 16 

by a * 

by * 

45, 31 

only fits to 

only applies to 

48, 13 

function (xi,...,x n ) 

function </>(xi ,..., x n ) 

49, 19 

zSy)\ 

zSy) 

53, 4-5 

hence a set 

hence ©(F f sc) is a set 

56, 32 

by 6.51, 

by 6.51, we have 

60, 20 

©(F) is an ordinal 

Now ©(F) is an ordinal 

62, 30 

P'il, 0 ) 

P‘( 7 ,0) 

62, 31 

P‘h,o) 

P‘( 7 ,0> 

62, 32 

P‘{l,o) 

P l ( 7,0) 

63, 19-20 

X is 

X is 

64, 23 

a + 1 < a 2 

a + 1 < a 2 

65, 4 

Dfn 

8.42 Dfn 

67, 28 

closure 

closure 

68, 27 

fundamental operations 

fundamental operations 

71, 21 

occuring 

occurring 

75, 15 

£[<&»* (3)], 

£[<£«>* (A)] for k = 1,2,3, 

76, 27 

notions, operations 

notions and operations 





Textual notes 315 

76, 28 

notions, operations 

notions and operations 

76, 29 

argument 

arguments 

77, 5 

cf. introduction p. 1. 

cf. page 1. 

77, 28 

u e X 

ue Z 

79, 14 

= .x e A 

= .xey _ 

79, 16 

= . ~xe.y 

=. ~iei 

79, 39 

A\- = 


80, 7 

£no(A) 

£no(A) 

81, 20 

applies 

applied 

82, 17 

xeT)(A) 

x e £(A) 

84, 6 

A a Z 

(A‘),X 

84, 10 

(Lomp(X) 

Comp(X) 

84, 11 

€omp;(X) 

Compj(X) 

88, 17 

different, since 

different. For 

89, 3 

m is 

Furthermore, m is 

89, 17 

F l 6' ■ F‘u a 

F‘8' ■ F“uj a 

89, 37 

m 2 . 

to 2 . Now 

91, 26 

F‘G‘a e F‘G‘a 

F‘G‘a e F'G 1 ^ 

93, 20 

a e to, by (2), Ody 

Now a e to, by (2); Od l y 

94, 11 

ue z 

So ue z 

94, 11 

u e z 

Then ue z 

95, 13 

hypothesis II 

hypothesis I 

95, 19 

symmetry reasons 

symmetry 

95, 22-: 

23 a e m-T]. 

aem-r). So 

95, 27 

symmetry reasons 

symmetry 

96, 19 

x e r 

Now x er 

96, 24-: 

25 F‘r)= J‘ 5 (/8,7> 

9= s(/3,7> 

96, 25 

F‘i = J^n') 

v' = 

100, 42 

X , Y 

x, y 


Godel 1944 

The 

first change below was made 

in 1972b, a reprinting of 1944, an d 

footnote 50 was then omitted (but is i 

•etained here). 


Original 

Replaced by 

122, 43 

-44 latest book 

latest book, An inquiry into 
meaning and truth. 

126, 16 

XI and XII 

xl and xli 

135, 12 

arbitrary 

arbitrarily 


Godel’s Nachlass contains four annotated offprints of 1944, catalogued 
there as items 040265 040268, and designated herein by the letters A D, 
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respectively. In addition, there is a single annotated page from a fifth off¬ 
print, Nachlass item 040269, designated here by the letter E; its annotations 
refer not to 1944 itself, but to Bemays’ review of it ( Bemays 1946). The 
annotations are variously in English, German, and Gabelsberger shorthand. 

In the list below, annotations to 1944 are cited by the page and line in 
this volume to which they refer. Where an annotation alters the printed 
text by insertion, deletion, or replacement of material, the original textual 
passage is reproduced, followed by a slash, the letter designation for the 
offprint bearing the annotation, a colon, and the text as altered. Other 
types of annotations are described within editorial brackets following the 
letter designation and colon. German annotations are followed by an En¬ 
glish translation enclosed within parentheses; in the German text itself, 
words transcribed from shorthand are set in slanted roman type. 

119, 6 the/D: the most general 

120, 7 thorough-going/C: thoroughgoing 

120, 21 rule/C: rules 

120, 23 symbols/C: symbols, 

120, 33 ideas/A: ideas, 

121, 37 An/D: Another 

122, fn 7 D: [At the end of the footnote Godel wrote 

in the margin “in the following] paper”] 

124, 11 assuming/D: assuming either 1. 

124, 12 or/D: or 2. 

124, 15 primarily given/D: arrived at first in the construction 
of language starting with the primitive terms of the 
language oder [(or)] initially 

124, 30 “simplicity”/D: “simplicity”, or perhaps one should rather 

say “nonselffeflexivity” [This insertion is preceded and 
followed by a question mark.] 

125, 9 appear/C: appear in v[on] Neum[ann’s] syst[em] of 

axpoms] 

125, 10 replace/D: replace only 

125, 20 terms/D: terms or terms denoting special classes or concepts 
125, 23 principles/D: principles concerning the solution of the 
paradoxes which were 

125, 24 principles/C: principles nec[essary] for avoiding the 
paradoxes 

125, 33-35 C: [Quotation marks to be deleted and the 
passage enclosed to be italicized.] 

125, fn 11 dealt with/D: dealt with in axiomatic set [theory] 

126, 3 quantifications/D: quantifications or classes 

126, 21 axiom/D: axiom (or rather with the decision to restrict 
oneself to such functions) 
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126, 

fn 

15 

127, 

2 


127, 

4 


127, 

15 

-16 

127, 

18 


127, 

18 


127, 

18 


127, 

20 


127, 

fn 

18 

128, 

13 

-15 

128, 

fn 

23 

128, 

fn 

23 

128, 

fn 

23 

129, 

3— 

4 

129, 

5 


129, 

8 


129, 

9 


129, 

14 


129, 

23 


130, 

4 


130, 

5-' 

7 

130, 

7 


130, 

10 



131, 

28 

132, 

18 

133, 

28 

136, 

17 

136, 

24- 


refer/D: refer and which is assumed when they are used 
nothing to do/C: [These words axe underlined and Godel 
has written in the right margin “? Zermelo hat jedenfalls 
etwas damit zu tun” (? In any case, Zermelo has 
something to do with that).] 
the/C: the merits [of the] 
axiom of reducibility/D: axioms of reducibility 
and of infinity 

defining outside/C: defining definitions not representable 
(expressible) in 

defining/D: defining the real nu[mber]s mentioned 
outside/C: in 

involve/D: involve the same or 
such classes u/C: such classes 
for translating ... contain it, ... fiction. 

23 /D: for its use, 23 (translating ... contain it) 

... fiction. 

One ... impossible/D: E.g. a rule for translating this 
latter conception of notions is, it is true, impossible 
for all notions, 

maintaining ... notions/D: maintaining this conception 
for all abstract notions 
or in fact/D: or 

rules ... containing/D: rules of use for 
thing/D: concept 
following/D: following tentative 
There/D: This definition is impossible because there 
classes or propositions/D: classes of a given type or 
propositions containing some entity a 
would/C: would likewise 
properties/D: properties of a given type 
the ... propositions/C: [This clause to be italicized.] 

There is no doubt/A: [These words are underlined and 
“falsch” (false) is written in the right margin.] 
contain themselves/A: [These words are underlined and 
“falsch” (false) is written in the right margin, followed 
slightly below by “wegen (a:) bedeutet nicht wie die 
Konjunktion” (since (a:) does not mean the same as 
the conjunction.)] 
element/A: elements 
in/D: in some 

, or combinations of such,/C: [This matter deleted.] 

function/D: function of integers 

this ... consistency of/A: it can be shown that 
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136, 25 
136, 26 
136, 27 

136, 28 


136, 31-32 

137, 8 


137, 36 
137, 36 37 
139, 24 

139, fn 47 

140, 33 


of Cantor’s/A: Cantor’s 
of the generalized/A: the generalized 
set/D: infinite set [The preceding word, ‘arbitrary’, 
is enclosed in brackets (to indicate deletion?)] 
subsets)/A: subsets) hold in the system of sets of 
all transfpnite] order[s] & that there these 
propositions] are compatible 
the former is, ... quite/A: the former, ... is quite 
clearly/D: in some sense [to which Godel added the 
footnote: propositions prior to their constituents 
(Wir erkennen zuerst Zahlen und dann erst verstehen 
wir Satze.) (First we recognize numbers and only then 
do we understand propositions.)] 
concept/D: well-defined concept 
for any ... arguments/D: of any object as argument 
140/C: 141 

reduced to/A: [These words are underlined and “falsch” 
(false) is written in the right margin.] 
axioms/D: true axioms 


In addition to the foregoing annotations, there are a number of more 
general remarks not tied to specific textual passages. They are grouped 
here according to the reprint on which they appear. 


Reprint B: 


The title page is covered with pencilled notes, mostly in shorthand, as 
follows: 

At the top: Die Stellen wo konstruktiv vorkommt sind hier angestrichen. 
(The places where ‘constructive’ occurs are marked herein.) [And in 
this reprint, all occurrences of the words ‘construct’, ‘constructive’, 
‘constructions’ and ‘constructivistic’ are indeed underlined.] 

Next below: Meine ph. Meinungen ausgesprochen auf: p. 127-128, 131, 
135, 137, 138-139, 140, 150-151, 152. (My philosophical views ex¬ 
pressed on pages 127-128, 131,135, 137, 138-139, 140, 150-151, 152.) 

Above the title: 4' p. 134 unten weniger wichtig: selbst in der konstruk- 
tierbaren Mathematik kann ein Begriff angewendet werden auf etwas 
daraus deSniert. 


4" p. 136 Ein gewisses vie. circle principle gilt fur Konstruktionen. 
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(4', page 134 below, less important: even in constructive mathematics 
a notion can be applied to something defined by means of it. 


4", page 136, a certain vicious-circle principle holds for constructions.) 


Below the title (as an index): 


Sense perc und Theorie in der Mathematik 
(Sense perception and theory in mathematics) 

Die beiden verschiedenen Interpretationen von Df 

p. 127-8 

(Kants “analytisch” kann so interpretiert werden) 
(The two different interpretations of definition 
(Kant’s “analytic” can be so interpreted)) 

p. 131 

Logical] intuition self contradictory 

p. 131 

Vicious Circle principle false 

p. 135 

Classes and some real objects 

Man kann von ihnen alien sprechen und das vie. 

p. 137 

circle princ. a nwenden. 

(One can speak of all of them and apply the 
vicious-circle principle.) 

p. 137-140 

Church typenfrei Th. 

(Church’s type-free theory) 

Th. der natiirlichen Zahlen nachweislich nicht 

p. 150 

analytisch im Kantschen Sinn. 

(The theory of the natural numbers [is] 
demonstrably not analytic in Kant’s sense.) 

p. 150 

Princ. Math, analytisch im allgemeinen Sinn 
(Principia mathematica [is] analytic in the 
general sense) 

Unser unvollkommenes Verstandnis der 
Grundbegriffe ist der Grund daB die Logik 

p. 151 

bisher unfruchtbar. 

(Our incomplete understanding of the 
fundamental notions is the reason that logic has 
so far not been fruitful.) 

p. 152 


On a loose slip of paper inserted between pages 132-133 of this reprint, 
Godel wrote: “bis p. 131 gelesen (wegen ph Inhalt und ob es gut 1st) 
14./XI.68” (Read to page 131 (on account of its philosophical content and 
whether it is good) 14 November 1968). 

In the left margin of page 140, opposite the underlined words “where it 
does not apply in the second form either”, Godel wrote “miiiSe das Ganze 
leseh” (would have to read all of it). 
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Reprint C: 


The following remarks are written at the top of page 125: 

“p. 140-141 Das vie. circ. pr. II for mere pluralities” sehr plausibel a us 
das eingeschrankte A ussonderungsax. implied by the cone, of set as plur. 
(iiber die a nderen Ax. wird nichts gesagt) Unterschied zwischen 2 Men- 
genbegriffe wird deutlich gemacht. (“pages 140-141: The vicious-circle 
principle II for mere pluralities” [follows] very plausibly from [the fact 
that] the restricted separation axiom [is] implied by the concept of set 
as plurality (nothing is said about the other axioms). [The] difference 
between two notions of set is made clear.) 

p. 132 Zermelo kann a Is “elaboration” der Idee von limited size betra- 
chtet werden. ([The work of) Zermelo can be regarded as [an] “elabo¬ 
ration” of the idea of limited size.) 


Reprint D: 


The following notes appear on the title page: 

gelesen bis p. 135 oben (read to top of page 135) 

p. 136 “constructive” de&niert (page 136: ‘constructive’ defined) 

The top of page 125 bears the notation: 

Bedeutung des terms “constructivistic” (meaning of the term “construc- 
tivistic”) 

A loose sheet was inserted between pages 130-131 of this reprint. The 
column on the left half of the sheet contains the following fragmentary 
remarks: 

Nach [(according to) an] antireal[istic] kind of constructivism]; i.e., 
the starting point and means of the construction] are to be exclu¬ 
sively sensual & material (e.g. symbols, their perceptual] properties] 
& relations] and the actual or imagined handling of them), not the 
element[ary] operations and int[uitions] of a new & irreducible entity 
called mind. The meaning of the term in question therefore is not ... 

I.e., the first alternative of footnote 23 applies while the second leaves 
room for irred[ucible] abstr[act] elements. 

The column on the right half of the same sheet contains these comments: 
Warum nichts von Weyl? (Why nothing by Weyl?) 

Warum kein Index ? ( Sachregister) Macht jedes Buch doppelt so wert- 
voll. (Why no (subject) index? [It] makes every book twice as valuable.) 
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p. 211 FuBnote (footnote): strictly antirealistic (i.e., nominalistic) 

The extension of this concept is equal to that of predicativity in a rather 
narrow sense, but admitting quantification in the def[initions], & there¬ 
fore ... 

On the back of the inserted sheet is the additional remark: 

since even Hilbert’s much more restricted “Finitism” does not start with 
symbols as such objects, but rather with [illegible word inserted above 
the line] a priori intuition of an idealized space & time [In this passage, 
the words “more restricted” are somewhat illegible; further down the 
page Godel wrote “the much narrower Finitism of Hilbert himself”.] 

The following appears at the bottom of page 135: 

Das stimmt nicht (rekursive Df. sind Df. auBerhalb des Systems, die a Is 
solche nicht im System sind). 

(That is not correct (recursive definitions are definitions outside the 
system, which as such are not in the system).) 

In the right margin near the bottom of page 137, is the remark: 

welche die einzige interessante Mathematik sincf' (which are the only 
interesting mathematics). 

In the left margin of the middle of page 140 is the annotation: 

? Brouwers selfreflex, und meine Beispiele oben 
(? Brouwer’s selfreflexivity and my examples above) 

Reprint E: 


This single page bears the heading “Bernays Rev. meiner Arbeit fiber 
Russell” (Bernays’ review of my paper on Russell), and contains the fol¬ 
lowing remarks: 

1. ) Misverstandnis meiner Interpret, der Typentheorie fur concepts an 

zwei Stellen (Misunderstanding, in two places, of my interpretation 
of type theory for concepts) 

2. ) “The whole of math.” muB vorausgesetzt werden. (“The whole of 

mathematics” must be presupposed.) 

3. ) Vermoge des meaning harm man a lie math. Satze a uf a = a re- 

duzieren. (By virtue of meaning, one can reduce all mathematical 
propositions to a = a.) 

4. ) Das Probl. der Beschreibvmg ist durch “Sinn” und “Bedeutung” in 

befriedigender Weise gelost. (The problem of description is solved 
in a satisfactory way by “sense” and “denotation”.) 
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5. ) Das Extens. axiom gilt nicht fur BegriSe. (The axiom of extension- 

ality does not hold for concepts.) 

6. ) Das Meiste, woran L[eibniz] dachte, ist bereits in der heutigen math. 

Logik enthalten. 

? und er deutet an: Man kann a us seinen Worten ersehen, daS was 
er sagt Unsinn ist ? (5 Jahre notig um es zu entwickeln zu einem 
powerf. inst. of reas.) 

(Most of what Leibniz thought of is already contained in today’s 
mathematical logic. 

? and he suggests: one can see from his words that what he says 
is nonsense? (5 years [will be] necessary in order to develop it into 
a powerful instrument of reason.) [The referents for ‘he’, in the 
passage enclosed within question marks, are unclear.] 

Toward the bottom of the page Godel wrote “Andere Rev. zitiert in vol. 
XI von J.S.L., p. 75”. (Other reviews cited in volume XI, page 75, of the 
Journal of symbolic logic.) 


Godel 1946 

Items marked with a single asterisk were changed to the new version in 
the Davis 1965 printing, while those marked with a pair of asterisks were 
changed to the new version in the Klibansky 1968 reprinting. 



Original 

Replaced by 

*150, 21 

different and by 

different. By 

*151, 11 

proposition 

propositions 

*151, 12 

non constructivistic 

non-constructive 

*151, 14 

infinity and it 

infinity. It 

*151, 20 

set 

set-theoretic 

*151, 29 

“mathematical definability” 

mathematical definability 

*151, 31 

it and again 

it. Again 

*151, 37 

e.g. 

i.e. 

*151, 39 

“definability in terms 

definability in terms 


of ordinals” 

of ordinals 

*151, 43 

property, i.e. by 

property, by 

**151, 43 

property, by 

property: By 

**152, 7 

sets, namely 

sets. Namely 

**152, 11 

assumed) and for 

assumed). For 

*152, 20 

question 

question of 

152, 29 

which 

who 

**152, 40 

Of course, you will 

You may 

**152, 42 

sets as described 

sets as conceived 
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**152, 44 

give nothing 

give, or are to give, nothing 

**153, 3 

any other 

any others 

**153, 5 

But, irrespective 

In conclusion I would like 
to say that, irrespective 

**153, 5-6 

this concept of 

the concept of definability 


definability 

suggested in this lecture 

**153, 7 

I think it has 

it has 

**153, 7-8 

are questions 

are two questions 

**153, 11-12 

It can be proved that 

It follows from the axiom 
of replacement that 

**153, 13 

can be at all defined 

can at all be defined 


Godel 1947 


In 1947, footnote 23, Godel made corrections to 1939a in the following 
sentence, which is omitted here since these corrections are now incorporated 
in the text of 1939a printed in the present volume: “I take this opportunity 
to correct a mistake in the notation and a misprint which occurred in the 
latter paper: in the lines 25 to 29 of page 221, 4 to 6 and 10 of page 222, 
11 to 19 of page 223, the letter a should be replaced (in all places where 
it occurs) by /z. Also, in Theorem 6 on page 222 the symbol *=’ should 
be inserted between <j> a { x ) and (fe(x').” See also the textual notes under 


Godel 1964 below. 



Original 

Replaced by 

177, 22 

the partila results 

the partial results 

178, 14 

confinality 

cofinality 

178, 39 

or continuum 

or that of the continuum 

180, 27 

suffice 

suffices 

186, 25 

confinality 

cofinality 



Godel 1949 

Godel 

used ch, sh and tg for cosh, sinh and tanh, respectively. 


Original 

Replaced by 

195, 1 

+x 

+x 0 

196, 38 

so-called 

so called 

197, 3 

u' = p-u- 9 

u' = p-u- q 

197, 3 

and 9 

and q 

197, 9 

u' = <jP - u- 9 

u' — <j 6 ■ u ■ q 

197, 9 

and g 

and q 

197, 42 

transformation 

transformations 
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Godel 1949a 


Godel’s additions to the 1955 German translation of 1949a have been 
inserted in square brackets in the text. 


Original 


203, 13-14 

203, 14 

204, 1-2 

204, 46 

205, 24 

206, 18 
207, 1 


destroy 

distinguish 

notion the mean 

my paper forthcoming in 

passed according 

that, whether 

exists), 


Replaced by 
destroys 
distinguishes 
motion the mean 
my forthcoming 
past according 
that whether 
exists) 


Godel 1952 


Throughout this article “Newtonean” has been changed to “Newtonian”, 
“Hamiltonean” to “Hamiltonian”, and “Lagrangean” to “Lagrangian”. 


Original 

212, 12 assymmetry 

214, 2 connect 


Replaced by 
asymmetry 
connects 


Godel 1958 

Since in 1972 above we produce a rather free translation of 1958, as 
revised by Gddel, here Stefan Bauer-Mengelberg and Jean van Heijenoort 
have endeavored to give as literal a translation as possible. They have es¬ 
chewed paraphrases and have rendered one word by one word (for example, 
Anschauung by intuition, and anschaulich by intuitive). 

In footnote 1 Godel refers to page 2 of Bemays 1954• He was apparently 
using an offprint with its own pagination, beginning with page 1. The 
proper reference is to page 10, the second page of the article. 

There is reason to believe that the “Zusammenfassung” was written by 
Godel, but no positive evidence has been found. The “Abstract”, repro¬ 
duced here from Dialectica, was probably translated from the “Zusammen¬ 
fassung” by an editor of Dialectica rather than by Godel, but here again 
no direct evidence exists. 

Original Replaced by 


251, 32 integral 


integer 
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Godel 1964 

During September 1966 Godel prepared two typed sheets of changes to 
1964 in anticipation of a third edition of 1947 (the paper 1964 constituting 
the second edition); he added a third sheet in October 1967. These sheets 
were found in his Nachlass and are incorporated in our text of 1964■ The 
major changes in the text are dated and enclosed there in square brackets, 
while the minor changes are indicated below by a single asterisk. On the 
other hand, a pair of asterisks below indicate errors introduced inadver¬ 
tently in the printing of 1964 but not found in 1947. The term “euclidean” 
has been changed throughout to “Euclidean” to agree with the usage in 
1947. 

Two of the major changes introduced on those sheets replaced passages 
in 1964 • The first of them substituted a new version of footnote 20, found 
in our text, for the version in 1964 as given here: 

“ 20 See Mahlo 1911 , pp. 190-200, and 1913, pp. 269-276. From Mahlo’s 
presentation of the subject, however, it does not appear that the numbers 
he defines actually exist. In recent years considerable progress has been 
made as to the axioms of infinity. In particular, some have been formulated 
that are based on principles entirely different from those of Mahlo, and 
Dana Scott has proved that one of them implies the negation of proposition 
A (mentioned on p. 266). So the consistency proof for the continuum 
hypothesis explained on p. 266 does not go through if this axiom is added. 
However, that these axioms are implied by the general concept of set in the 
same sense as Mahlo’s has not been made clear yet. See Tarski 1962, Scott 
1961, Hanf and Scott 1961. Mahlo’s axioms have been derived by Azriel 
Levy from a general principle about the system of all sets. See his 1960. 
See also Bernays 1961, where almost all set-theoretical axioms are derived 
from Levy’s principle.” 

The second major change consisted of substituting a new version of the 
postscript, printed in our text, for the version in 1964 as given below: 

“Shortly after the completion of the manuscript of this paper the ques¬ 
tion of whether Cantor’s continuum hypothesis is provable from the von 
Neumann-Bernays axioms of set theory (the axiom of choice included) was 
settled in the negative by Paul J. Cohen. A sketch of the proof will appear 
shortly in the Proceedings of the National Academy of Sciences. It turns 
out that for a wide range of H r , the equality 2 N ° = is consistent and an 
extension in the weak sense (that is, it implies no new number-theoretical 
theorems). Whether for a suitable concept of “standard” definition there 
exist definable N T not excluded by Konig’s theorem (see p. 260 above) for 
which this is not so is still an open question (of course, it must be assumed 
that the existence of the N r in question is either demonstrable or has been 
postulated).” 

Finally, there is omitted from footnote 24 of 1964 the same sentence (of 
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Textual notes 


corrections to 1939a) that was omitted from footnote 23 of 1947. See the 
textual notes above to 1947. 



Original 

Replaced by 

256, 26 

confinality 

cofinality 

**256, 31-32 

So the continum 

So the continuum 

**256, 36 

definitions pp. 

definitions on pp. 

256, 39 

or continuum or 

or that of the continuum or 

*259, 9 

for theorems about 

for theorems depending on 

**261, 8 

continum problem 

continuum problem 

262, fn 21 

in terms of ordinal numbers 

in terms of ordinal numbers 

264, 25 

confinality 

cofinality 

*267, 20 

Cantor’s 

The generalized 

*267, 23 

the continuum 




Godel 1972 



Original 

Replaced by 

271, 27 

proofsheets 

proof sheets 

273, 28 

ordinals, is 

ordinals is 

275, 29 

higher type functions 

higher-type functions 

278, 30 

Number theoretic 

Number-theoretic 

278, 36 

Outside the 

Outside of the 

279, fn 1 

insure 

ensure 

280, 29 

F,G 

F and G 


Godel 1972a 

In Godel’s galley proofs, from which 1972a is printed, he replaced, at 
305, 2, “weaker hypotheses” in his earlier version of Remark 1 (1967, 616) 
by “sole hypothesis”. 

Original Replaced by 


306, 2 
306, 27 


this 

non recursive 


the 

non-recursive 
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